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A NOTE ON THE THEORY OF COMMUNICATION 
THROUGH SOCIAL CHANNELS 


N. RASHEVSKY 


COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


In a previous paper (Bull. Math. Biophysics, 12, 215, 1950) a theory 
of transmission of information through a chain of individuals was de- 
veloped. It was assumed that each individual can exhibit only two mu- 
tually exclusive reactions, which serve as signals, thus transmitting to 
his neighbor one bit of information per signal. In the present paper the 
theory is generalized to the case of any number m of mutually exclusive 
reactions, giving thus log,m bits per signal. The general character of 


the process of transmission remains very much the same as before; the 
mathematical expression becoming, however, more complicated. 


In a previous paper (Rashevsky, 1950b; hereinafter referred to 
as loc. cit.) we studied the transmission of communication through 
a chain of individuals in which each individual is capable of only one 
out of two mutually exclusive reactions used as signals. Thus one 
individual can transmit at most one bit of information per signal. We 
shall outline here a generalization of the theory to the case in which 
each individual has the choice of one out of m possible reactions. The 
study of this case requires a generalization of H. D. Landahl’s (1937) 
theory of cross-inhibited parallel chains with fluctuating excitations 
from two to m chains. A direct exact approach to that problem leads 
to great difficulties (Rashevsky, 1950a). Landahl (1941a,b) in his 
theory of learning has, however, suggested an approximate approach, 
which we shall use here. 

With a Laplacian distribution of random fluctuations the ex- 
pression for a wrong response R, to the stimulus S, in the two-stimuli 
case is given by (Rashevsky, 1948, chap. xli): 


P= 4e*ere) (1) 
where the notations are the same as in the above reference. Landahl 


(1941a,b) [cf. also Rashevsky, 1948, chap. xli] suggests for the case 
of m stimuli the following expression for the probability P. of a 


wrong response R;(i # 1) to the stimulus S, 
139 
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ae 1 = 
Pe ee (2) 
m 
where 
— + &g ++:++ + Em 
pos Se eee (3) 
m—I1 


m—A1 
When all the «;’s are equal, P, = ———, as should be the case. 
m 


We shall assume in general for the probability P; of a response 
R, when all e,’s are different, the expression 


P;=1=]P,=1— eklei-e] (4) 
m 
with 
ae ey eo + tee, + Bi bo em 
sD ee ce en ee oe (5) 
Tr. 


Due to the approximate nature of expression (4), > P; is in general 
not equal to 1. It may be possible to normalize expression (4) ap- 
propriately, so as to fulfill the requirement § P;—1. 

If, instead of a Laplacian distribution of the fluctuations, a nor- 
mal distribution is used, then we have, for the case of two stimuli 
(e, > e.) (Landau, 1950): 


p.=o(2—*), (6) 


1 a v2 
G| “|= ~ e= dx. (7) 
( V2ne iP 20° 


In line with Landahl’s generalization, we shall assume the following 
expression: 
Let «,, denote the root of the equation 


Km 1 
@( = Fe (8) 
o m 


The quantity «,, is always negative for m > 2. Then for the prob- 
ability P; of response R; put 


where 
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Km 1 8; — 6) 
or) (9) 


o 


a 1 
When all e’s are equal to (e; = &’) we have P; = —, as should be the 
m 


case. As e; tends to o , P; tends tol. 

Let the e;’s Perresent permanent excitations at the correspond- 
ing connections. They are present in the absence of any external 
stimuli S;. If such external stimuli are present, the corresponding 
é; 1s increased by an amount proportional to S; (neglecting peripheral 
thresholds). With a proper choice of units we now have, instead of 
é; , the quantity e; + S;. Hence equation (9) becomes 


Km ae &j se Sj (e ate S®) 


a  e (10) 
C 
with 
Sa la Ss. 57 see Sha + Say3 +4... Ss 
SOS A es (11) 
m—1 


Expression (10) is of course only a rough approximation. 

As in loc. cit. we shall consider that the sight of a reaction RF; 
performed by another individual acts as a stimulus S; on the given 
individual. We again limit ourselves to the rather artificial assump- 
tion that all reactions, and hence all stimuli, are of the same inten- 
sity S. If now in a chain of individuals, the (n — 1)st individual 
makes a reaction R; , this reaction acts as a stimulus S; of intensity 
S on the nth individual. Since we assume that only one out of m re- 
actions can be produced at a time, the nth individual is subject to a 
stimulus S; = S, while all other S;’s (k #7) are equal to zero. The 
probability of his producing the reaction R;, as a result of this is, 
therefore, given, according to (10), by 


mtextS—e , 
Bie (=) (12) 
Cc 


On the other hand, the probability Pi, that the nth individual will 
produce the reaction R; (k # i), when the (n — 1)th produces R;, 


is given by 
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Ss 
Km + & — e®) ——— 
m—1 


Co 
because now, according to (11), 


a S 
5s =—, (14) 
ne 
If all ¢,’s are equal, so that there is no natural bias in favor of one 
or another reaction, we have, instead of (12) and (18), 


Ka ta 
Co 
and 
S 
Nh oo ee 
m—1 
pues) A (16) 
Cc 


In this case all Pix.’s (4 # k) are equal, as should physically be the 
case. If m is large and S is large, then P;; is close to 1, while Piz is 
very small (Fig. 1). 


mel 
Se a Fm O 2 
FIGURE 1 
For not too large values of S we may derive the following ap- 


proximate expressions by approximating the curve of Fig. 1 by a 
straight line 


1 S 
has ; —oVn/2 oVn/2 
o sfaee Va/2<S <ovVn/2 (17) 


whose ordinate is equal to 1/2 for S=0. Then, from (15) and (16) 
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Ll «mt S 
P;,=-+ ae (18) 
2. V2nG 
and 
Ss 
iL m—1 
ee ee. (19) 
2 \V/ 21 
Hence 
Ss m 
A=Pi,— P= See age a (20) 
V2n2e0m—1 


In general different individuals in the chain will have different 
biases «;’s. Hence their P;;’s and Pix’s will be different. We shall 
denote by Pi;(n) and Pi,(n) the corresponding values for the nth in- 
dividual. Denote by p(n) the probability of reaction R, by the nth 
individual. Since the possible reactions of the (n — 1)st individual 
act as stimuli on the nth individual, we have: 

px (N) =p, (n— 1) Py. (1) cz P2(n — 1) Pox (1) 
+ s++* Dm (1 —1) Px (”). (21) 
Since 


3 p.(n—1) =1, (22) 


therefore, putting 

Pix(n) — Px (n) = Aix (%) 5 

Px (n) = B;(n) (23) 
we may write expression (21) in the following way: 


y(n) =p,(n—1)Ay.(n) + Po(n—1)Ax(M) +--+ 


+ Dna (NM —1)Amai(m) + Be(n). (24) 
Introducing matrix notations we have: 

p(n) =p(n—1)A(n) + B(n). (25) 

This recursion formula is satisfied formally by the matrix expression 


B(k) 
p(n) =p(0)A(1) A(2) fa ao (26) 
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in which I denotes the unity matrix. When the expressions P;; and 
P;, (k # 7%) are independent of n, then all the square matrices A, 
are equal to A, and all the row matrices B, are equal to B, and ex- 
pression (26) reduces to 

I— A” 


p(n) = p(0)A" + B 7 


(27) 


We may now, as in loc cit., ask the question as to what is the 
probability of the nth individual producing any reaction R,, if the 
zeroth individual produces a given reaction R;, which, without loss 
of generality, may be put as R,. In this case the row matrix p(0) is 
simply equal to 100----. 


For purposes of obtaining simple closed expressions even the 
restricted expression (27) is still rather inconvenient. We may, how- 
ever, draw some important conclusions from it. 

As in equation (46) of loc. cit., we may interpret the first term 
of the right side of (27) as due to the transmitted information, while 
the second term may be interpreted as due to channel noise. Equa- 
tion (44) of loc cit. shows that for m = 2 the probability of receiving 
a signal different from that given at the input is determined only by 
the channel noise. In the present more general case of m > 2 this 
is not so. If we denote by (A”) i, the ikth element of the matrix A” 
and remember that p,(0) =—1, ,(0) #0; (r #1), then we find from 
equation (27): 


1— A" 
p, (m) =D, (0) (A) a + (2 ; (28) 
I—A ], 

Equation (28) shows that p,(0) contributes in general to any 
pr(n),7r #1. In other words the occurrence of any reaction R,, dif- 
ferent from FR, , is due in general to transmission as well as to chan- 
nel noise. 

If, at the input end, R, is always produced, so that p,(0) = 1, 
and, if we are interested in the probability p,(n) of R, at the output 
end, we see that the part of p,(n) which is due to transmission is 
given by 


p(n) eas (A”) 11s (29) 


The element (A”),, of the matrix A is an nth degree polynomial 
in the Aj,’s. Since Ai, < 1, we see that the transmitted information 
decreases with 7 , tending to zero as n tends to infinity. 
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Because of the formal analogy of equation (27) and of equation 
(44) of loc. cit. we can derive in a similar way as before the expres- 
sion for the a posteriori probability of R, at the input, when R, is 
received at the output. We find now, using notations similar to those 
used in loc. cit., 


~ Pen + MYin 


Pn (30) 


V wp nv 


Hence as 7 increases, p, tends to 1/m. 

A rather interesting case to study would be the following. Sup- 
pose that all possible reactions R, can be arranged linearly in order 
of their similarity. The similarity may be either a phonetic or a se- 
mantic one. The determination of the similarity may be made by 
standard psychometric methods. The following situation is quite 
plausible. If the (x — 1)st individual makes a reaction R; , then the 
probability that the mth individual will make a reaction R; (1 # 7) 
will be the greater, the smaller the similarity between R; and R,; 
that is, the greater the distance between R; and R, in their linear 
arrangement. A neurological mechanism, providing for this, can be 
readily suggested. Mathematically we may express the situation by 
taking the matrix P in expression (21) to be symmetric, and by put- 
ting 


Pi, = Pri = U] aly ase (31) 


The rows and columns of the matrix A now become geometric 
series, the ratios of which are powers of e. It may perhaps be 
possible to find at least approximate closed expressions for p;,(m) in 
that case and study the effect of 1 on the decrement in transmission. 

The author is indebted to Dr. George Karreman for a careful 
check of this paper. 
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A NOTE ON IMITATIVE BEHAVIOR AND INFORMATION 


N. RASHEVSKY 
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The equations for imitative behavior developed previously indicate 
that the imitation effect increases with the number N, of individuals in 


the social group. In this note it is pointed out that the above conclusion 
holds only for not too large values of N,. The above conclusion is based 
on the tacit assumption that each individual is completely informed about 
the behavior adopted by every other individual. If, however, N, becomes 


very large, the information per individual decreases, As a result of this, 
the effects of imitation either increase with N, less rapidly or actually 


become independent of N,. 


The basic equation of imitative behavior, as developed previously 
(Rashevsky, 1951a), with the same notations as used before is: 
dy 
—=A(X-Y) — ay. (1) 
dt 
The quantity X—Y, which is the difference between the number X 
of all individuals who exhibit behavior R, and the number Y of all 
individuals who exhibit behavior R, , acts as a stimulus to imitate be- 
havior R,. Since 


Xo Y= Noy (2) 


the largest possible value of X — Y is N.. With increasing yw the 
quantity X — Y tends asymptotically to N., and, if all other para- 
meters are kept constant, the value of X — Y for any particular y is 
proportional to N,. 
Equation (1) holds, however, only with the tacit assumption 
that each individual actually knows the amounts X and Y. For rela- 
tively small values of N, this may be the case. As Np increases, an 1n- 
dividual is likely to have complete information only about those other 
individuals who are in his immediate neighborhood. Generalizing the 
concepts used in loc. cit., considering X and Y now as functions of the 
coordinates w’,y’, and introducing a function of the distance 


r=V@—«#) + yy) 
147 
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between the given individual located at «,y and an individual at a a 
we obtain an integrodifferential equation (Rashevsky, 1950a). This 
equation, as it has been shown, leads to a nonuniform distribution 
of behavior. 

In this note we shall consider a different effect. We shall assume 
that the proportion of individuals exhibiting the two behaviors, Ff, 
and R.,, is the same everywhere in the region occupied by the social 
group. We shall, however, introduce the effect of limitations of trans- 
mission of information. 

It must be emphasized from the outset that all the following con- 
clusions apply only to the case of transmission of information through 
social channels of the kind discussed previously (Rashevsky, 1950b; 
1951b). Present day mass communication techniques invalidate those 
conclusions. For the study of some past social conditions these con- 
clusions may, however, be of interest. 

In preceding papers (Rashevsky, 1950b; 1951b) we have seen 
that the amount of information transmitted through a chain of in- 
dividuals tends to zero as the length of the chain increases. The ex- 
pression for that decrease is rather complicated and is given by a 
combination of exponentials. We also have considered the effect of 
the circumstance that in general there is'a very large number of 
chains between any two individuals [Rashevsky, 1950b, equation 
(62) ]. 

We shall now remark, that while the above-mentioned expres- 
sion (62) gives us the maximum possible information that can be 
transmitted between two individuals regardless of time, the actual 
information which an individual can receive from another within a 
fixed time is much less. It takes a finite time to receive any signal 
from an immediate neighbor, and the number of neighbors which an 
individual is able to contact within a definite time is limited. There- 
fore, only relatively few channels, all in the neighborhood of the 
shortest one, actually will be used. The decrease of information with 
distance will therefore be given, with a sufficient approximation, by 
expression (58) of a previous paper (Rashevsky, 1950b). 

Let us, for simplicity, assume that we may approximate the ex- 
pressions for the decrease of information transmitted through a chain 
consisting of n individuals by a simple exponential: 


R(n) =e*, (8) 


If r is the length of the chain, and a the average distance between 
two neighbors, then 
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T—an. (4) 
If 6 is the density of population, then we have approximately: 
1 
Ca rerio (5) 
Vo 


Hence, now expressing R as a function of r , we find from (38), 
(4), and (5): 


R (ays ever, (6) 


Let an individual be at the origin of polar coordinates 7,0. lhe 
amount of information which the individual receives about all indi- 
viduals located within the element of area rdrdé is 


dre*vs "drdé . (7) 


To obtain all the information which the individual receives about 
all individuals of the society, we must integrate expression (7) over 
the whole region occupied by the society. The result will depend in 
general on the size and shape of the region. Let us now find the 
amount T of information received by the individual about all indi- 
viduals within a circle of radius 7) which is much less than the linear 
dimension of the area occupied. It is given by the expression 


T= 2 a5 il re*vs "dy 
0 


1 - 1 - 
Sei 8 5, rte pee otek (8) 
ie ral kV 


From the discussion in a preceding paper (Rashevsky, 1950b) 
it follows that the constant k is most likely to be of the order of unity. 
Namely, P’ is of the order of 1 unless we have individuals strongly 
addicted to making errors, and P” << P’. Hence A <1. Therefore 


A* = e194, Hence 
k~logA <1. 


With 6 of the order of 10 ind/km2, the exponential will be very 
small for values of 7) > 2 km. Thus for circles with a radius of over 
2 km, the expression (8) will already have practically reached the 
value 22/k?, toward which it tends asymptotically as 7 tends to in- 
finity. We therefore see that for any area, the linear dimensions of 
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which are much larger than 2 km, the information received by an 
individual about all the other individuals is a constant, independent 
of either N, or 6. The information per individual equals 27/k?No, and 
decreases with N,. 

This result holds only to the extent to which R(n) may be ap- 
proximated by a simple exponential. Since it actually is a more com- 
plicated function, T in general, will depend on 6, and through it, in- 
directly on Ny. 

The following consideration can be applied to any function R(7r), 
which decreases sufficiently rapidly, so that 


fom (r) dr 


is finite. Let us use as a measure of distance between two individuals 
the number n of individuals in the straight line between them. Then 
we can define a circle of radius ), such that the contribution to T 
coming from within the circle is, say, 99% of the total T, coming 
from the whole populated area. The quantity 1, depends only on the 
psychophysiological characteristics of the individuals, and not on Np 
or on the size of the total area. In terms of the actual distance the 
radius of the circle is, according to expressions (4) and (5): 


No 


(9) 


The area of the circle is 2.2/6, and the number of individuals 
within it is mn”. Regardless of No and the size of the total populated 
area, 99% of the information which an individual has about all other 
individuals is confined to those zn,? individuals. 

Thus, regardless of the actual No, the number of individuals 
which a given individual can imitate is, with great approximation, 
mn”, and is in principle expressible in terms of the parameters A and 
B, and, therefore, in terms of P’ and P” of the previous paper (Ra- 
shevsky, 1950b). This quantity should be substituted for N, in all 
the equations of the theory of imitative behavior, whenever N, be- 
comes very large. As long as m is not expressed explicitly in terms 
of P’ and P” and its numerical value computed, it is simpler to absorb 
that constant with the constant A of equation (1). For the study of 
equilibrium configurations which are determined by the equation 


ayp—A(X—Y), (10) 
Wwe may put a, —a/A , and write 
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aqy=—x—Y, (11) 


The quantities X and Y now represent those fractions of the total 
number of individuals which exhibit behavior R, and R, respectively. 
We now have X + Y=1. 
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It has been suggested that when an organism is exposed to ionizing 
radiation the initial damage results from the occurrence of ionization in 
a so-called sensitive volume due to absorption of radiation quanta. The 
initial radiation damage is then transmitted or amplified to a level of 
macroscopic perception. In this paper a mechanism by which this trans- 
mission may take place and a finite Markov chain model applicable to 
this transmission are postulated and discussed. 

This mechanism is assumed to be the depolymerization of essential 
chain molecules which are connected to some “central group” associated 
with the sensitive volume. The depolymerization of the macromolecules 
following a hit in the sensitive volume is postulated to be determined by 
a chain mechanism, which acts in a manner inverse to the mechanism 
controlling the polymerization process. A mathematical study of this 
problem is made using the theory of Markov chains. The probability of 
complete degradation of the chain macromolecule, and the probability 
of recombination of the units to give the intact chain were determined, 
assuming that the probabilty of successive steps in the degradation in- 
crease linearly from the intact state to that of complete breakdown. 


1. Introduction. The idea of chain processes for the study of 
radio-biological phenomena has been suggested by F. Dessauer (1923) 
and developed by M. Blau and K. Altenburger (1923). The recent 
studies of I. Opatowski (1945; 1946 a, b) resulted in the development 
of a general theory of chain processes and their application to injury 
and recovery in biological systems following irradiation. In Opatow- 
ski’s theory the occurrence of an effective event in the sensitive vol- 
ume is considered as a transition of that volume to a new state. The 
observed effect is reached after a certain number of transitions be- 
tween the states have taken place, so as to reach some final state 
which represents the observed effect. 

It has been pointed out by J. Reboul (1942) that there is a dif- 
ference between the probability that a hit occurs, and the probability 
that this hit will cause a certain effect. The importance of this con- 
cept has been discussed by J. Th. van der Werff (1948). The first 
probability depends on the geometry of the target and the absorption 
of radiation quanta, while the second probability is an expression of 
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the degree of amplification or transmission of the initial hit damage 
through the system. This paper will be concerned with the second 
probability. A mathematical model, based on the theory of Markov 
chains, and a mechanism by which this transmission may take place 
will be postulated and discussed. 

2. Mechanism for the transmission of radiation damage. In or- 
der to make our mathematical model structurally meaningful we will 
first consider the proposed mechanism. Consider the sensitive volume 
of an organism to be associated with some control molecule, or group 
of molecules, that was present in the early stages of development. If 
we assume free valances on this structure, the presence of which per- 
mits the addition of other cellular constituents, we can postulate the 
polymerization of essential long-chain molecules connected to this 
group. The long-chain molecules may be nucleic acids, etc. 

The decomposition of these macromolecules following a hit in the 
sensitive volume is now postulated to be determined by a chain mech- 
anism, which acts in a manner inverse to the mechanism controlling 
the polymerization process. Such a mechanism has been considered 
by W. Chalmers (1934) in his studies on addition polymers. The ran- 
dom degradation of high polymers has been studied by many authors 
(cf. Mark and Tobolsky, 1950), and the statistics and kinetics of this 
type of degradation are well-known. We cannot, however, use a ran- 
dom model to explain transmission for it is based on the probability 
of a break occurring at any link in the polymer chain. We are inter- 
ested in the case where the initial break occurs at the end of the chain 
(i.e., the end connected to the control group), the subsequent steps in 
the degradation being the breaking off of successive monomer units 
from the damaged end. A random model might be applicable to a 
group of chain molecules with a number of control centers, and so 


spatially oriented that a hit in any one of these centers would result 
in the same effect. 


3. The mathematical model. We consider a Markov chain (Fel- 
ler, 1950) with finitely many states: 


Eo 2 Ey eg => aa ae 


The system can be considered in the state E, initially (i.e., the chain 
is intact), and following a hit in the sensitive volume passes to the 
state H,. The transition HZ, > E, can be considered as a manifestation 
of the primary damage to the system, that is, the absorption of radi- 
ation quanta in the sensitive volume. The transitions FE, > E, > ---- 
— E, will represent the transmission or amplification, and E, will 
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represent the state when the radiation damage becomes observable. 

The transition probabilities, pj.(j,k =0,1,----, a) are the con- 
ditional probabilities of the system in a state E; passing to a state 
E;,. We define our transition probabilities as follows: 


Diga — 1/0, (3 —1,---,a—1), (1) 
Degen 4j — 1: , G — 1), (2) 
Poo = Paa=1, (3) 

pix —0, otherwise. (4) 


The forward one-step transitions (1) represent the transmission, and 
the reverse one-step transitions (2) represent recovery. Transmis- 
sion of damage or recovery can occur only by passage from a state to 
its immediate neighbor. The condition that »;; = 1 when j= 0 anda 
means that no change will occur after the states H, or E, are reached; 
i.e., they are absorbing states, or, in the terminology of random-walk, 
there are absorbing barriers at 7 = 0,a. This means that no recovery 
is permitted from the state #, should the system enter that state. If 
the system never enters EF, it must eventually return to E, (i.e., com- 
plete recovery, recombination of the units of the chain molecule) since 
the probability of remaining in any intermediate state (H,,---+, Has) 
is taken to be zero. The alternative to never reaching either E, or Ey 
would be for the system to shift from one to another of the intermedi- 
ate states indefinitely. From the calculation below, it follows that 
there is zero probability for this indefinite transition between inter- 
mediate states. 

We see from our definitions that the probability of the system 
reaching FE, increases as it moves away from the intact state, while 
the probability of recovery decreases. The assumption of linear in- 
crease in the transition probabilities represents, of course, an ap- 
proximation for the sake of simplicity, made in the absence of more 
detailed information on the dependence of these probabilities on the 
extent of the degradation. Similar unsymmetric random-walk models 
have been studied by P. Ehrenfest and T. Ehrenfest (1907), M. C. 
Wang and G. E. Uhlenbeck (1945), and M. Kac (1947) in connection 
with the theory of Brownian motion. 

We arrange our transition probabilities in a matrix P = (pjx) 
where the first subscript stands for the row, the second for the col- 
umn. A matrix P is called stochastic if pj, > 0 and for every J, 


DVPn=1. 
k=0 
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We see from our definition of the transition probabilities that our 
matrix satisfies these conditions. The matrix P is given by 


1 0 0 0 0 0 0 0 0 
1a 0 a” 0 0 0 0- ~ 0.- 0 
0 Lea 2a 0 0 0 0 0 
0 0 13a 0 od 0 0 0 0 
0 0 0 14a7> 0 4a7t 0 0 0 
0 0 0 0 0 0 os a 0] oe 
0 0 0 0 0 0 ees Out 


3a. We wish to determine the probability 4, , probability of ob- 
servable damage, that the system initially in E, will eventually pass 
to E,, also the probability 4, , probability of recovery, that it will re- 
turn to E,. Now the system can arrive at E, from EF, only by first 
reaching E,., and then making the transition H,.. ~ E,. The prob- 
ability of the transition from EF, to E,.. in n steps will, as usual, be 
denoted by Di.4‘”. Then 


© 
ha = Pa-t,a > Dege™ . (5) 
n=0 


where we use the natural definition, ,..-.°°’ —0. Similarly, the prob- 
ability 2) will be given by the sum of the probabilities that in » steps 
the system arrived in the state H#,, multiplied by the probability of 
transition, FE, — H,. Hence 


Ao= Pw > Pu™, (6) 
n=0 


where, of course, 91:° =1. 

We see from our matrix P that p,, is the element in the second 
row and second column of the nth power of P, that is, P”; and p41 
is the element in the second row and (a—1)st column of P”. There- 
fore 


N 
> Pu 
n=0 


is the element in the second row and second column of J + P + P2 + 
‘+++ + P™, where J is the unit matrix. This sum does not converge for 
N > o, but we can compute the sums in (5) and (6) as follows. 

The matrix P can be written in the form of a partitioned matrix 
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a | 0 | ) 
| | 
l 
A | B | C 
| | 
| | 
0 | 0 | 1 
where 
0 at 0) 0 Ome 
1~2a-*=~0) 2a 0 0 0 
0 1-3a-1_ 0 3a"? One 
Bel SS et ee (7) 
0 0 0 0 eee 0 1-207 
0 0 0 0 Secon ee ee) 


That is, B is the reduced matrix obtained from P by eliminating the 
first and last rows and columns. Now it is clear that P” has the form 


1 0 0 


| 
A, | ite Cs 
| 


0 0 1 
where B” is the nth power of B. 
Putting 
€éu =D Pu”, (8) 
n=0 


we have e,, equal to the element in the first row and first column of 
be 
n=0 


provided this series converges. The convergence, and the limit, may 
be obtained as follows. Let 


N 
Sy = =; B* ’ (9) 
n=0 
then 
Sy — SyB= I — B™, (10) 


We show that lim B’*! = 0. The elements of B*** are pj, , for 
N>o 


j,k =1,2,----, a—1, and they are all positive. We use induction 
to show that 
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a1 

a DR) = (1 =<, Chien ee i 
j=1 

From the definition of B, 
a-1 a-1 
> P= > Da? =1— 20". 
j= g=l 

Using the induction hypothesis, 


a-1 
Spe Ss A—2a*, 
j=l 


we have 
a-1 a-1 a-1 a-1 a-1 
= PREY = SS Pip Der =D Pow D Dip 
j= jai B=1 B=1 j=l 


a-1 
= (1—- 2a)” & Pe = (1—2a*)™. 


Hence py. S (1 — 2a*)** and py, — 0 for N > ow, since 
1—2a* <1. 
Then from (10), 
lim Sy = (I— B)-, (11) 
No 


provided (J — B) is non-singular, which we now show by calculating 
its determinant. 


We have 
1 —a 0 5 : E . 
pas ees 1 —2a"* 
0 —1+3a" 1 
ie , (12), 
1 —1+2a71 
a? 1 
a —1 0 
—at+2 a —2 
Beerret 0 3 a 
e e ° a —a+2 


e ° —l a 


Let A be the determinant of this matrix, so |J — B\ =a" A. To 
compute A we add the elements in each row and place these sums in 
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the first column, then add the elements in each row excluding the 
first column and put these in the second column, ete. 
Then 


a—1 —l 0 0 0 0 
0 a—2 —2 0 0 0 
0 0 a—3 —3 0 0 
A= 0 0 0 a—4 0 0 (13) 
0 0 0 0 : 2 —at+2 
a—l1 a—1 a—|1—~a—l1 : a—1 a 
Expanding by elements of the first row 
a—2 —2 0 2 0 0) 
0 a—3 —3 ee) 0 
sa 0 0 a4 . 0 0 (14) 
Se) | : £24, eine a (G1) 
0 0 0 Bos W a—2 
a—1l a—l a—l oo) o—l |G 


Continuing to expand by elements of the first row gives 
A= (a—1)(a—2) X 


a—3 —3 ; 0 0 
0 a—4-. 0 0 aot 
5 St 8 Ya OU ene 
0 0 2 —at 1 
a—l a—l a—l a 
CEA 2 O12 
ve as a—l a 


(15) 


reomoifae(* 7 )e( So) et (SY 
=cempr [4 ( 8) a t(8TS) 8 | 


a-1 —iI1 
a (@—1)15 ( ‘ , )=2@-p1 


k=0 
The cofactor of the element in the first row and first column of A is 
equal to 


160 RADIATION DAMAGE 


A—(a—1)! 


a—l 


= (¢a—2)! (2-*—1). 


Therefore 
a—2)! (27+*—1) a 1 
peg | ae ee eee vals 
(a—1)!2- i-— 1 Ale 


We find therefore that 


1 
Ao = (1—a)ey=1— 90-1" (17) 
Similarly we obtain 
1 
Aa = . (18) 
Qa-1 


so that 4) + 44 = 1 as remarked above. 

3b. An alternate procedure, which also enables us to obtain 
the probability 4, is as follows. Let x, be the probability of being ab- 
sorbed at EH, starting from the state E,; we have then the difference 


equation 
k k ksi 
a= tan t( ees Jaa= Vk+1 
a a a 


1 k—1 I 
— — Vier = 1— Je ig here 


(19) 


a a a 


This equation is valid for 1 S k [= a — 1, and valid for all k = 1 if 
we define %..: = 1. We have also the boundary conditions 


%=—0,%=1. (20) 
Let 


f(z) =Sanet, lal eed 


Then we can rewrite (19) as 


[o') (oo) 1 [o} 
D U2 = > — aye? — — 3 aye 
k=0 k=o @ a k=o 


(21) 
1Lo\8 nok 
+( 1-= )Sace— 34 nae 


Qa / k= k=0 @ 
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and, using 
f (2) =2 kay, 2, 


we obtain 


il 
(a—1)z———-a 


f(a) + SS IO) =0. (22) 


— 2? 


The solution of (22) is readily obtained since the equation can be 
written 


BSE) oe by: 1 a—1 
f(z) Zz 1 ee 
so that 
z(1 +2)*2 a-1 a—1 (o) 
ft =o | 3 ( )a|[ Z| > (28) 
1—z k=0 k | m=0 


where C is a constant of integration. We are only interested in the 
coefficients x, , (0 = k = a). These can be seen to be 


ee 2 1. 
m=C3( . (24) 


m=0 m 


Utilizing the boundary conditions (20) to determine the constant C, 
we find that 


a—1l1 a—l hel 
en Ge a OO 
0 1 a—1l 

so that 


at BY Moreh | 
n=2( )/ 2,0 2kSa. (25) 


m 


If we let k=1 in (25), we find that 


a—1 1 
— 20-1 = * (26) 
Ha ( 0 \/ Q0-1 


in 1 e ° 
since * —1. Hence z, is equal to 4, determined previously. 
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4. Concluding remarks. In this paper we have discussed the 
transmission of radiation damage under the assumption that a hit in 
the sensitive volume of an organism is followed by a chain depoly- 
merization of some essential macromolecules. We also assumed that 
these macromolecules have their origin at some structure which is 
present in the early stages of cell development. The results obtained, 
(17) and (18), show that for large values of a recovery is almost 
certain. In terms of biological systems this means that the prob- 
ability of an observed effect and the probability of recovery are re- 
lated to the extent of polymerization of the macromolecules, when 
radiation quanta are absorbed in the sensitive volume; the probability 
of an observed effect being greater during the initial stages of the 
polymerization process. From studies on the sensitivity of organisms 
to irradiation (Sax and Swanson, 1941) it has been shown that sen- 
sitivity changes with age. The model postulated here is based on 
the assumption that the sensitive volume is a dynamic cellular entity ; 
therefore we would expect its properties to be intimately related to 
the metabolic and physicochemical activities of the cell. 

It has been pointed out that many organisms are highly sensi- 
tive to radiation during the metaphase of the mitotic process. Since 
aster formation occurs during the metaphase we might speculate on 
a relationship between the site of aster formation and the sensi- 
tive volume. The possible influence of X-ray irradiation on aster for- 
mation has, apparently, not been investigated. In terms of our model 
the centrosome might be considered as the control group, and the 
asters the chain molecules that undergo depolymerization. 

This work was aided in part by a grant from the Dr. Wallace C. 
and Clara A. Abbott Memorial Fund of The University of Chicago. 
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A “generic” problem amenable to matrix algebraic treatment is 
outlined. Several examples are given and one, a communication system, 
is studied in some detail. 

A typical structure matrix is used to describe the channels of com- 
munication and a “status” matrix is used to describe the distribution of 
information in the system at any time. 

A theorem is proved relating the status matrix at any time t to the 
tth power of the structure matrix. 

The elements of the communication system are interpreted as indi- 
viduals who can send messages to each other. For the individuals at- 
tempting to solve a “group problem” certain relations are derived be- 
tween the structure and status matrices and time of solution. 

The structure of the communication system is permitted to vary 
with time. A general theorem is proved relating the status matrix to the 
matrix product of the series of structure matrices representing the 
changing structure of the system. 

Some suggestions are made for further generalizations. In particu- 
lar, it is suggested that so-called “higher order” information transmis- 
sion can be similarly treated. 


Introduction. A variety of problems which have been (or could 
be) studied by the use of matrix representations have some interest- 
ing properties in common. The kinds of problems which we refer to 
are exemplified by: the mathematics of radiation and cosmic ray 
counters, the theory of neutron production and absorption, mathe- 
matical genetics, experimental design, the mathematics of peck right, 
epidemiology, the ontogeny of neural nets, stereochemistry, commu- 
nication systems, and a host of others. 

This wide range of problems (or significant aspects of them) 
may be regarded as special cases of the following “generic” problem: 

A certain number (n) of essentially “equivalent” objects are 
the elements of a system. Associated with every ordered pair of ele- 
ments (which may be regarded as distinguishable) is the affirmation 
or negation of k relations. We may symbolize the affirmation of the 
pth relation between the ith and jth elements by the expression 
iR,j . This, of course, need not imply that jF,i, that is, the relation 
R, need not be symmetric. 
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The negation of the pth relation between the ith and jth elements 
may be written as ik," . 

In addition to the relations associated with the ordered pairs of 
the system each element may be associated with one or more of m 
intrinsic properties. Thus the affirmation (of possession) of intrinsic 
property q by element 7 can be symbolized by the expression 7P, , and 
its negation by iP,7. 

The relations and intrinsic properties of the system may, In gen- 
eral, imply a dynamics which is reflected by a quantized temporal 
shift of relational and intrinsic property affirmations to negations 
or vice versa. 

Matrix Representation. For any given problem of this kind our 
attention may be focused upon the “structures” implied by the rela- 
tions between the elements or upon the dynamics of the system. In 
either case matrix representation can be used to advantage. Such a 
system can be described by matrices in the following way: 

First, number the elements of the system from one to n in some 
arbitrary fashion. Assume for illustration that only one type 
of relation will be considered. In other words, for each ordered pair 
of elements 7,7 we will say either 7Rj or iR-4j, i.e., 7 is related to 7, 
or 7 is not related to7. 

The set of all such relations can be conveniently described by an 
n X n matrix whose elements are either R or R*. Thus if a typical 
element of the matrix e;; is R , then we will say iRj . If, however, the 
typical matrix element e;; is R“+, then we will say 1R-47. Such a 
matrix can be called a structwre matrix insofar as the relations be- 
tween the elements imply a structure of the system. The matrix 
representation can be extended to include any number, say k, of 
different kinds of relations. In this case it would be necessary to use 
an nk X n matrix or kn X n matrices to represent the totality of re- 
lationships. 

In a similar way the distribution of intrinsic properties among 
the elements of the system can also be represented by a matrix. If 
we consider a system possessing m different classes of intrinsic prop- 
erties, then an » X m matrix will do the job. Such a matrix can be 
called a status matrix insofar as the distribution of properties among 
the elements of the system describes a status of the elements as com- 
pared to each other. 


An Example. For the sake of illustration, consider the following 
problem. How many structurally different compounds are represent- 
ed by the formula C,H, (only saturated hydrocarbons considered) ? 
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If we picture such a compound in a conventional steric diagram, 
it becomes evident that every carbon atom in the molecule (except 
for the special case of methane) is chemically bound to one or more 
other carbon atoms. In the most general case such bonds may be 
single, double, or triple, but for saturated compounds only single 
bonds occur. 

Now, if we let the integer 1 represent the relation “has a single 
bond with” and if we let 0 represent its negation, that is, “is not 
bound to,” then, by arbitrarily numbering the carbon atoms from 1 
to uw we can write a series of propositions of the form i1j and p0q. 
In words, these expressions mean “carbon atom i has a single bond 
with carbon atom 7” and “carbon atom p is not bound to carbon atom 
qe” 

According to the procedure outlined above, the totality of rela- 
tions among the carbon atoms can be represented by a uw X wu struc- 
ture matrix. 

For the trivial case of CH, (methane) we have a1 X 1 matrix 
whose only element is 0. In other words, the only carbon atom in 
the molecule has no bonds with itself. Ethane (C.H,) is represented 
by the 2 X 2 structure matrix 


0 1 
( 12-0 ) 

The elements of the major diagonal of such a structure matrix 
will always be 0 since the statement “atom 7 has a single bond with 
atom 7” is meaningless. The relation “has a single bond with” as used 
here is always reciprocal, so it follows that the structure matrix will 
always be symmetric around the major diagonal. Also note that the 
sum of the elements of the structure matrix is exactly twice the num- 
ber of carbon atoms used in forming the “backbone” of the molecule. 
It follows then that 


[yi lpzp yo Cij—. (1) 
Gat ERE 

The class of all pure saturated hydrocarbons having the formula 
C,H, can, therefore, be represented by a class of uw X mu matrices. It 
must be noted, however, that the total number of isomers of the form 
C,H, is considerably smaller than the number of different u X u mat- 
rices which can be used to represent them. This is due to the fact 
that the numbering of the carbon atoms is arbitrary and that renum- 
bering does not change the structure. It is further complicated by 

the fact that renumbering also may leave the matrix unchanged. 
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For these reasons many matrices are equivalent in the sense 
that they represent the same substance. The original problem, there- 
fore, of finding the number of different substances represented by 
the formula C,H, reduces to the problem of counting the number of 
subclasses of equivalent matrices making up the class of uw X mu mat- 
rices representing the formula Cy . 

The fact is that this particular problem has been attacked and 
partially solved by number and group theoretical methods (Polya, 
1936). These results are, of course, translatable into the language of 
matrices and, therefore, should have a direct bearing on all of the 
other phenomena subsumed by the aforementioned “generic” prob- 
lem. 


Other Examples. In the above example no status matrix is in- 
volved and the attention is focused on the structure of the system. 
However, aS was mentioned in the introduction, the existence of a 
structure matrix and status matrix describing the same system fre- 
quently implies a dynamics. Accordingly, the dynamics of a neural 
net conforming to the postulates of McCulloch and Pitts has been 
described and analyzed by the use of matrix algebra (Landahl and 
' Runge, 1946). 

In this case the elements of the system are neurons. The rela- 
tions appearing in the structure matrix are “can stimulate” and “can 
inhibit.” The intrinsic properties associated with the status vector 
are “is active” and “is not active.” In this paper it is shown that 
consecutive products of the status vector with the structure matrix 
give consecutive status vectors which represent the “state” of the 
net for different moments. 


In a subsequent paper H. D. Landahl (1947) extends the re- 
sults of the first paper and treats the “reverse problem,” i.e., instead 
of describing the behavior of a given net he starts out with a given 
dynamics and defines the kind of net which will exhibit such be- 
havior. 


As a final example A. Rapoport (1949) and H. G. Landau (1950) 
use matrix theory in the study of the dynamics of chicken “societies” 
organized by the relation of “peck right.” 

The foregoing is not intended as a proof for the wide applicabil- 
ity of matrix representation, for this is well known. The point is that 
in the cases cited above the structure matrix represents, in one way 
or another, a connected system, a network, an organization, or “lace- 
work” of simple relations. The structure matrix can, therefore, be 
interpreted as an important invariant of what may be loosely termed 
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“group order.” An. extensive study of square matrices, therefore, 
with an eye to the structures they imply may lead to useful defini- 
tions of such vague concepts as organization, group stability, and 
SO on. 

Some important steps in this direction appear in recent litera- 
ture (Luce and Perry, 1949; Luce, 1950). The authors define an 
organizational concept which they call a clique and the more general 
notion of an n-clique. Roughly speaking, a clique is a certain or- 
dering of relations in a subgroup of a system. R. D. Luce and A. D. 
Perry show certain relations between the existence of cliques and 
certain properties of the matrices representing the system. 

In addition, these authors apply matrix algebra to a simple 
“communication system” in a manner quite analogous to the Lan- 
dahl-Runge treatment of neural nets. They arrive at a recursion 
formula for consecutive states of the system which is also analogous 
to the formuia of Landahl and Runge. 

Group Communication. The Problem. The following problem, 
although in a way more general than that approached by Landahl 
and Runge, Luce and Perry, is nevertheless treated in a similar man- 
ner. The statement of the problem and some of the definitions may, 
therefore, be expected to overlap somewhat. 

Suppose that a group of 7 persons are asked to play the follow- 
ing game. Each member is told at the outset that he must send ‘‘mes- 
sages” to certain other specified members of the group. On the other 
hand, he is not told who the recipients of the other members’ “mes- 
sages” are. In fact, at the beginning of the game he does not even 
know who, if anyone, will send messages to him. The point of the 
game is for each member to know the complete structure of their 
communication system. In other words, each member must know the 
destination of every other member’s messages. 

Accordingly, the messages themselves must contain such infor- 
mation. A typical message will read “The destination of C’s mes- 
sages is D , the destinations of B’s messages are A and D, and so on.” 
The expression “the destinations of B’s messages are A and D” is ac- 
tually a statement of all the information which member B possessed 
at the beginning of the game. Accordingly we shall refer to such ex- 
pressions as primary elements of information. 

Now since there are » members in the group, there must also 
be n primary elements of information. The game will end, there- 
fore, when every member possesses ” different primary elements of 


information. 
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The Method. If we number the members of the group from 1 to 
n in some arbitrary fashion we can proceed in the usual manner to 
construct a structure matrix which will represent the communication 
system. The matrix can be constructed by rows. To write down the 
jth row of the matrix we ask “Does member j send messages to 
member 7”? If the answer is yes, then the element e;; of the struc- 
ture matrix is one. If the answer is no, then the element e;; is zero. 
By repeating this procedure n? times the complete structure matrix 
can be written down. 

For example, see the three member group diagrammed below 
in Figure 1 and a corresponding structure matrix as follows: 


lees On eee 
fhe knoe (2) 


FIGURE 1 


Notice that the diagonal elements of the structure matrix are 
all 1’s. This would be as though each member of the group sends 
himself messages. This notation is useful in the operations which 
follow. We can think of it as though each number repeatedly ‘“‘re- 
minds himself of” or “remembers” the information which he has 
already obtained. 

A status matrix describing the distribution of primary informa- 
tion throughout the group can also be constructed. This can be done 
columnwise. To write down the jth element of the 7th column we 
must ask: “Does member 7 possess the primary element of informa- 
tion associated with member 7”? If the answer is yes, then the jth 
element of the ith column is one. If the answer is no, then the jth 
member of the ith column is zero. This process defines the status 
matrix. 

At the beginning of the game each member of the group will 
possess only the primary information associated with himself. In 
other words, all that he will know is where he sends his own mes- 
sages. The status matrix at the beginning of the game will, there- 
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fore, be the identity matrix. The matrix will have 1’s along the ma- 
jor diagonal and zeros everywhere else. 

Successive States. As the game proceeds the information will 
begin to shuttle from person to person according to the pathways of 
the system. This will result in a constantly changing status matrix. 
Now, if the rate of flow of messages were in no way controlled, cer- 
tain differences between the various people playing the game would 
begin to show themselves. Some members of the group may find 
themselves, for example, sending only one-third as many messages 
per unit time as other members. In order to remove such differences 
between the individuals from our considerations we will stipulate that 
each member must send messages along all of his permissible chan- 
nels exactly once per unit time. The problem is now well defined and 
we can ask the first question, namely, “How long will the game last”? 

Before going on to answer this question it should be noted that 
a slight change in the definition of the status matrix makes it pos- 
sible for us to prove a useful theorem. To this end we will say that 
if the general element of the status matrix e;; is some positive in- 
teger, then member i possesses the primary element of information 
associated with member 7. However, as before, if e;; is zero, then 
we shall say that the member 7 is ignorant of the primary element 
of information associated with member 7. 


Fundamental Theorem. The status matrix, representing the dis- 
tribution of primary elements of information among the members of 
the system after t units of time, is given by the tth power of the 
structure matrix in the sense of ordinary matrix multiplication. 

Proof: Lemma. If S denotes the structure matrix and /; de- 
notes the status matrix after ¢ units of time then, 


f= 15S. (3) 


Let e;;" denote the typical element of the status matrix after ¢ units 
of time. Also, let s;; denote the typical element of the structure ma- 
trix. According to the definition of ordinary matrix multiplication, 
we have 


Dis = Sixers + 8320p) +++ + Sin€nji; (4) 


where ;; is the typical element of the matrix product LS. 

Since none of the elements si, or ¢; is ever negative, it fol- 
lows that neither the right-hand member of expression (4) nor any 
‘of its terms s;x@;“ can ever be negative. This implies that if the 
right-hand member of expression (4) is zero then all of its terms 
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Sixexj* are individually zero. However, Sixexj‘) = 0 means either 
Six = 0, ej; = 0, or both. But, according to the definition of the 
structure matrix, si, = 0 tells us that member 7 never receives mes- 
sages from member k. From the definition of the status matrix 
ex; = tells us that member k (at the time ¢) does not possess the 
primary element of information associated with member j. If either 
(or both) of these conditions hold, it follows that member 1 could 
not obtain the primary element of information associated with mem- 
ber 7 at the time t + 1 from member k. If this is true for all of the 
terms in the right-hand member of expression (4), then it follows 
that member i could not obtain the primary element of information 
associated with member 7 at the time t + 1 from any other member. 
We must conclude then that, given these conditions, the general ele- 
ment of the status matrix at the time ¢ + 1 must be zero, i.e., 
e;;*) =0. But, given these conditions, we see from expression (4) 
that pi; is zero. Hence, for this case we find that pi; = e:;‘*?. 

The only remaining possibility is for the right-hand member of 
expression (4) to be a positive integer. This means that s;¢,; (for 
at least one &) is a positive integer. In order for this to be true we 
must have sj, = 1 and e;) = + integer. But s;, — 1 means that 
member 7 does receive messages from member k. Also, ej; = + 
integer means that member k (at the time t) does possess the pri- 
mary element of information associated with member 7. From this 
we must conclude that member 7, at the time ¢ + 1, will possess 
the primary element of information associated with member 7. This 
implies that e;;"* must be a positive integer. But under these con- 
ditions pi; is a positive integer, as a glance at expression (4) will 
show. Hence again e;;‘*”) = »;;. Thus the lemma is proved. In 
other words, expression (3) is a valid recursion formula. 

Now let us recall that the status matrix I, (at the beginning of 
the game) is simply the identity matrix. Using expression (3), we 
get 1, = 1S. But I,S = S; hence I, = S*. It follows by induction 
that the fundamental theorem is true, that is, 


I,=S'*. (5) 


Expression (5) makes it possible for us to give at least a for- 
mal solution to the original problem, namely: “How long will the game 
last” ? 

If any element of the status matrix I, is zero, then we know that 
at least one member of the group does not yet possess the primary 
information associated with at least one other member, and so the 
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game is not finished. The game will end at the first moment when 
every element of the status matrix is non-zero, The game will end, 
therefore after T units of time, where T is the smallest power of the 
structure matrix containing no zero elements. 


Adequate Systems. Not all groups will be able to end their game. 
Their channels of communication may be inadequate. Such inade- 
quacy may be due to an insufficient number of channels or an inade- 
quate arrangement of those which are available. We shall refer to 
the structure matrix of such an inadequate communication system 
as an inadequate matrix. Similarly, the structure matrix of an in- 
adequate communication system can be called an adequate matrix. 
Some interesting details relating to these notions of adequate and in- 
adequate matrices are developed by Luce (1950) in the section on 
connectivity in a structure. 


In order for a structure to be adequate, it is necessary for every 
member to send messages to at least one other member and to re- 
ceive messages from at least one other member. In fact, there must 
exist at least one path of channels from any given member of the 
group to every other member. If this were not so, that is, if no such 
path existed between member 7 and member 7, for example, then the 
primary element of information associated with member i would 
never be available to member 7. In terms of the structure matrix 
this means that there must exist some sequence of integers k,, k., 
ks, +--+ Km (not necessarily all different) such that six,, S:,%,, Skx, 
-«-- $5 are all 1’s. 

The Cycle. Since every member must send messages to at least 
one other member, it follows that at least n channels are necessary 
in order to construct an adequate system. In fact, this number is 
sufficient if the channels are laid out in a simple closed chain which 
includes all of the members of the group. We shall refer to the ma- 
trix of such a structure as a minimal-cyclic matrix. 

Changing Structure. The game can be complicated somewhat 
by permitting the members to shift their channels of communication 
in some prescribed temporal order. The structure matrix, under these 
conditions, would change from time to time. 

In order to describe such a system it would be necessary to write 
down a series of structure matrices S,, S.,-::: , S:,++-:, where S; 
represents the particular communication structure being used at the 
time t. We shall refer to such a series of matrices as a structure 


series. 
The General Theorem. The status matrix, representing the dis- 
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tribution of primary information among the members of the system 
after t units of time, is given by the matrix product S,S.8; ---- St, 
in the sense of ordinary matrix multiplication. 

Proof: In proving recursion formula (3) we assumed that the 
structure matrix S was constant with time. But this assumption in 
no way affects the argument, since it deals only with the tth exchange 
of information. This permits us to write the slightly more general 
recursion formula: 


lin lpr (6) 


But since J, = I,S, = S,, the general theorem follows by a simple 
induction on expression (6). 

Adequate Structure Series. We have already shown that certain 
trivial structure series (constant adequate structure) results in a 
finite game. We may now ask for more general conditions on the 
structure series such that it will be adequate. 

Theorem. The maximum number of different structure matrices 
representing possible communication systems between n labeled mem- 
bers is given by the expression 2-2, 

Proof: The 1’s appearing in the ith row of a structure matrix 
tell us (by definition) to whom member 7 sends his messages. It may 
be that he sends no messages to anyone, in which case the only 1 ap- 
pearing in the ith row would be element s;;. But this can happen in 
only one way. It may be that member 7 does send messages to ex- 
actly one other member, in which case a 1 would appear elsewhere 
in the row. This could happen in n — 1 different ways. In general, 
member i may send messages to k other members. This could 


happen in( a ee =) different ways, where ( be 3 ‘) is the number of 


ways in which we can choose k objects from n — 1 objects, all dif- 
ferent. The number of ways N in which a given row can be “dis- 
posed of” is given by the expression 


(eae n—1 is . tly 2. 
2 )+( 3 ar (astee (7) 


It follows that the number of ways in which all of the rows can be 
disposed of together is N” = 2("-»), which proves the theorem. 
Definition: Any structure series which indefinitely repeats a sub- 
series of structure matrices will be called a cyclic structure series. 
Theorem. Any cyclic structure series which contains one or more 
adequate matrices is an adequate structure series. 


De stm 
iG oe 


N—=1+(n— 1), 


Suhyay 
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Proof: If a positive integer appears in some element of the 
status matrix at some time ¢t, then, by virtue of the fact that each 
member “reminds himself of” his already acquired information 
(every diagonal element of any structure matrix is 1), that particu- 
lar element of the status matrix can never again be zero, for any 
time after ¢. It follows that if a given adequate matrix is applied 
to the changing status matrix every k& units of time, then every ele- 
ment of the status matrix will be a positive integer in at most kT 
units of time, where T is the minimum power of the adequate struc- 
ture matrix containing no zeros. Such a cyclic structure series must, 
therefore, be adequate. 

Since the game always ends at the moment when zeros no longer 
appear in the status matrix, it follows that the game will last 7 units 
of time, where T is the minimum subscript of the structure matrix 
for which the product S$,S,S, ---- S- contains no zeros. That this 
product contains no zeros for some T is, in fact, the definition of an 
adequate structure series. 

It is possible for every matrix of a structure series to be inade- 
quate and still form an adequate structure series. For example, con- 
sider the structure series S$,S.S,S. ---- , where 


1 0 i 8 
Sa and S.— 
1 1 0 a 


It is readily seen that the element e., of structure matrix S, and 
element ¢,. of structure matrix S, will remain zeros for all powers 
of these matrices. This means that both S, and S, are inadequate 
matrices. The product S,S., however, contains no zeros; hence, the 
structure series is adequate. 


Efficiency. For a group of any given size certain structure mat- 
rices or structure series end the game faster than others. The solu- 
tion time T is, therefore, a kind of measure of the efficiency of the 
communication system. The various adequate structure matrices and 
series can, therefore, be ordered in a preferential hierarchy. 

If every member of a group could communicate directly with 
every other member of the group, only one interchange of informa- 
tion would end the game. However, such a communication system 
would require n(n — 1) channels. On the other hand, a minimal- 
cyclic structure has a solution time of T= —1, but it has only n 
channels. For some purposes we may be interested in the efficiency 
of a communication system as reflected by its economical use of chan- 
nels. The number C of channels in a given structure is simply the 
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sum of the elements of the structure matrix minus the number of 
elements in the major diagonal; thus we have 


C=S33s,—2. (8) 


Now let us return to the structure matrix (2) of the three member 
group example. The status matrices associated with this structure 
are as follows: 


eK Pay a 12 1=-2 
f= 100. 10206 | Ste 0) ee eee (9) 
Guam ‘ip ae at jr A as 


The solution time of this structure is 2 units, since J, is the first 
status matrix containing no zeros. Notice, however, that three of 
the elements of J, are 2’s. This means that each member of the group 
received one primary element of information on two different oc- 
casions. 

For some purposes such a repetition of information may be con- 
sidered an advantage insofar as information received many times 
may perhaps be more likely accurate. 

We may, however, be more interested in an economy of mes- 
sages. This may be particularly true when the system is such that 
each message is very likely to be accurate. In this case it would be 
ideal if the solution matrix contained only 1’s. 

From the above considerations it would seem that the ratio M 
of the number of elements in the solution matrix (n*?) to the sum of 
the elements of that matrix could serve as a useful measure of the 
system’s efficiency in regard to message economy; thus 

n?2 
Wee 


- (10) 
2 ej 
i,j=1 
It should be noted, however, that the sum of the elements of the 
solution matrix does not give the total number of messages sent. This 
sum is usually larger than the number of actual messages. To inter- 
pret the meaning of the operations between the structure and status 
matrices accurately, we must say that a given member who has re- 
ceived a primary element of information k times by the time ¢ will 
repeat that element of information k times in all of the messages he 
sends at the time t + 1. Actually, this simply implies that the sum 
of the elements of the solution matrix reflects a “weighting” of the 
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evils of redundance. It is as though this sum were saying: “Tt is bad 
to send the same information three times, but it is more than twice 
as bad to send it six times.” 

So far we have three possible measures of efficiency: the inverse 
of the solution time 7-", the number of channels per member C/n, 
and the redundancy ratio M . 

Minimal-Cyclic Structure. In a minimal-cyclic structure the most 
remote member from any given member is n — 1 steps away. This 
means that the solution time T of a minimal-cyclic structure is n — 1 
units. The efficiency of such a system (in regard to speed) drops 
linearly with increasing size of the group. 

This structure has exactly 1 channel per member so that in this 
sense it is perfectly efficient. 

By elementary considerations we find that the redundancy ra- 
tio M of a minimal-cyclic structure is given by the expression 


M = n/2", (11) 


This means that the redundancy rises sharply with increasing size 
of the group. 

Theorem. The redundancy ratio M of any series structure 
S,S.8,; ---- which consists entirely of minimal-cyclic matrices is al- 
ways less than 1. 

Proof: In order for the redundancy ratio ™ to be 1 it is neces- 
sary that all of the elements of the solution matrix also be 1. The 
determinant of such a matrix, however, is zero. This means that the 
ideal solution matrix is singular. 

It can be readily shown that a minimal-cyclic matrix is never 
singular. But the solution matrix is given by a product of the form 
S,S>---- Sp in which all of the factors are non-singular matrices. The 
theorem follows from the fact that the product of a non-singular ma- 
trix with a non-singular matrix is also non-singular. 

Partial Solution. An interesting variation of this game would be 
to stop it after a designated playing time. We would undoubtedly 
find that some structures had progressed further toward solution 
than others (as measured by the number of zeros in the final status 
matrix). 

It may be that under certain circumstances a communication 
system of this kind is in constant danger of being completely blocked. 
It may also be that certain structures which are highly efficient ac- 
cording to the previously mentioned standards actually transmit most 
of the information just near solution time. 
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Such a structure would not be very useful in such “dangerous” 
circumstances, since its disruption before solution time would leave 
most of the members ignorant of most of the information. 

Other structures not quite as efficient according to the previous 
standards may exhibit a rapidly growing spread of information in 
the system. If it were disrupted before solution time at least much 
of the information would have already been transmitted. 

Also in this connection, if we permit negative numbers in the 
status matrices to represent false information and in the structure 
matrices to represent “chronic” liars, we would have situations anal- 
ogous to the spreading and checking of false “rumors.”’ Perhaps sim- 
ple matrix operations can be devised to represent these phenomena. 

Higher Order Information. So far the players of the game were 
all interested in only one question, namely, “What is the structure 
of the system”? In a higher order game the participants might be in- 
terested not only in what they know about the structure of the sys- 
tem, but also in what everyone else knows about the system. 

The distribution of information in such a system can be repre- 
sented by a matrix whose elements are also matrices. A discussion 
of such higher order games will appear in a subsequent paper. 

This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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The output of a neuron innervated by two other neurons which, 
in turn, are subjected to two independent Poisson showers of stimuli, is 
derived as a function of the frequencies of the Poisson showers under 
two distinct assumptions, 1) where either of the two neurons can fire 
the third, and 2) where the stimuli from both neurons must impinge 
within a certain time interval to fire the third. For very small frequen- 
cies, the output of the third neuron is very nearly the sum of the input 
frequencies in the first case and proportional to the product of the in- 
put frequencies in the second case. Hence the designation “addition” 
and “multiplication” theorems. This treatment is a generalization of a 
previous treatment where the Poisson shower was assumed identical for 
the two outer neurons. 


References to I, II, III, and IV will apply to a series of papers 
entitled ‘‘Contributions to the Probabilistic Theory of Neural Nets” 
(Rapoport, 1950a, 1950b, 1950c, 1950d). In III and IV we consid- 
ered the output of a neuron N; subjected to stimuli from two other 
neurons, N, and N.2, which, in turn, were subjected to a single Pois- 
son shower of stimuli. We now wish to generalize the problem by 
supposing that N, and Nz receive separate independent Poisson show- 
ers of frequencies x and y respectively. 

CASE 1. “ADDITION” OF FREQUENCIES. We shall first assume that 
a stimulus from either N, or Nz can fire N; provided it does not fall 
within the refractory period of N;, which, as in earlier papers, we 
designate by 6. 

If the refractory period of all three neurons were zero, the out- 
put of N; would be simply x + y, the sum of the outside inputs. If 
the refractory period of N; only would be zero and that of N, and Nz 
were 6, then the outputs of N, and Nz would be «/(6 x + 1) and 
y/(6 y + 1) respectively (cf. I), but the output of N; would still be 
the sum of the outputs of N, and N.. If, however, all three neurons 
have refractory period 6, the situation is more complicated. This is 
because the outputs of N, and N. are not Poisson-distributed and 
hence cannot be treated in the same way as the original “outside” 


input. 
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The Frequency Distribution of the Time of the First Firing. As 
in a previous paper (III), we wish to determine the probability fre- 
quency distribution of the time of the first firing of N;, where the 
origin of time is chosen 6 units following a firing. This distribution 
will in this more general case be different depending on whether that 
last firing was due to a stimulus from N, or from N,. Let us suppose 
it was due to N,. Then the probability frequency will be given by 
the following equation, which is a generalization of equation (6) 
OLE LV: 


t 


t 
yd y 
pi (6,t) =xe**) 4 — : i eleva 
ef 6 Ye Ore tL of 
0 ad 0 


et (1) 
em [x+y + vy(6—t)], for tS6; 
dyt+1 
5 oe 
d exp (y 6 — d 
(0,1) =20*"| 1 ay | a ge 
L dyt+1 dyt+1 
9 5 
t 5 
cae EDU eae) : : 
dbyt1 aa ve T (2) 
0 ~ 
__ («+ y)exp(y 6 — yt — at) 


= torit> 6% 
CP sie Al 


It can be readily verified that for « = y, equations (1) and (2) 
reduce to equations (8) and (9) of IV, as, of course, should be the 
case. 

' If we choose an initial firing due to N., then our frequency dis- 
tribution will evidently be of the same form as (1) and (2) but with 
x and y interchanged, namely, 


evi 
)2(6, t) = at+yt+ay(o—t <6; 
2 Prats: y + xy ( )], fort=d; (3) 
(x + y) exp(x 6 — at — yt) 
P2(d, t) = —————_______-, fort> 6. (4) 
oe +1 


From these frequency distributions, the expected time of the first 
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firing can be computed. For the distribution p,, it is 


(oa) 5 


tene 
tp, (6, t)dt= [x + y+ xy(d6—t)] dt 
fi 1 {| eal y ( )J 


t(x + y) exp(y 6 — yt — zt) 
ie yt 
F oyt+1 


(5) 


a: “(it dy) —y?(1—e*%— 62) 
a(a+y) (oy +1) 3 


For p.(6, ¢) we need only interchange x and y in (5) and obtain 


(oa) 


2 1+ 6 Sy) — p-yd —__ 
| ee eS (6) 
‘ y? («+ y) (6x2 +1) 


As a check on the computations we note that both (5) and (6) 
reduce to (12) of IV for « = y, as should be the case. 

Relative Frequencies of Firing due to N, and N.. Equations (5) 
and (6) give the expected times of the next firing of N; following 
firings due to N, and Nz respectively. To get the average expected 
time of the next firing, we must know how to weight the two known 
expected intervals, i.e., we must know the relative frequencies with 
which a firing of N; is due to N, and to Nz respectively. 

Selecting a firing of N; at random, let gi be the probability that 
it is due to a stimulus from N, and q, that it is due to a stimulus from 
Ne (di + G2 = 1). Moreover, let g:2 be the probability that a firing 
due to N, follows one due to N,, and analogously define qis , qi, and 
G22. Then we have 

Q1 = 1911 T G2Qo1 
(7) 
Qe = Qi912 + GeQ22 - 

Equations (7) state that a firing is due to N; if 1) either the 
preceding firing was due to N, and a firing due to N, followed it, or 
2) the preceding firing was due to Nz and a firing due to N, fol- 
lowed it; similarly for a firing due to N.. The multiplications of the 
q’s are justified, since the q’s are actually conditional probabilities, 
e.g., “given that a firing is due to Ni, qi2 is the probability that the 
next firing is due to N. .” 
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We now proceed to compute the qi; (i = 1, 2;7 =1, 2). We 
note that the first terms on the right sides of (1) and (2) give the 
probability frequency distribution of the time of the following oc- 
currence: “The first stimulus following an instant 6 units after a 
firing due to N, comes from N,.” Therefore, if we integrate the sum 
of these terms from zero to infinity, we shall obtain the probability 
that it occurs at all, i.e., the probability that the first impulse after 
6 units after a firing due to N, comes again from N,. But this is qu. 
Therefore 


6 t 
adr 
Ube == | ze (1— A ) dt 
: ~oyttl 
(8) 
oa) 6 t 
d 6— 
| 2en( 1 yds ronyo—¥ a, ) at. 
, ’ oyt+l ; 6yt+1 
Similarly, 
6 t 
Qi2 = | ie (1-" xe" dr )) ee 
: Oy +1 i 
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The expressions for q2: and @2. are obtained in an analogous 
manner. Performing the indicated operations we obtain 


e+ éy) +¥(e*+32—1) 


qh ' 10 
: x(a + y) (dy +1) oo 
Qu =1— Gu, (11) 
_V(l+6%) + 2(e% + dy—1) 
lan ict <n pili Se le eae a (12) 
y(x+y)(d6%+1) 
Gor =1— don. | (13) 


Equations (7) can be readily solved for G:/Q2 , namely, ~ 
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Q:/G2 = CL G23) / (1033). (14) 


Substituting the values of qi: and gs. from equations (10) and 
(18) into (14), we have 
Q x[u?(1—e) + xy] (dy +1) 
a ylyt(l—e*) +ay](de+ 1)" 
Finally, since g, + gz =1, we obtain 
= x fa?(1 — €49-F xy] (6 y + 1) 
oo x [x? (1—e>) +xy] (6y+1) +y[y? (1—e~*) +r] (6a+1) fe 
yly?(1—e*) + xy] (6% + 1) 


de = (17) 
xx? (1—e) +ary] (6y+1) +y[y? (1—e**) +ay] (62+1) 


(15) 


The average time between firings will then be obtained by the 
weighted sum (where the weighting factors are q, and q2) of the right 
sides of (5) and (6) increased by 6, namely, 


Exot [ tp.(3,t)dt +a. f tp, (3, t)dt, (18) 
it) 0 


which, after elementary but laborious computation, gives 

bo ye (le) + ay) (x7 + 6 y) yet — bz) ] 
Fitba (l —¢*>) + ry) [y? (1 + 6x) (19) 
EOS RP ae ee 


where 
Ae ay (oy) (ie ee) a? y)] yt 3) 
(20 
+ [y3(1—e*) + wy?](6x+1)}. 
The frequency of N; is then 
Wi ACA by (ae (1 — |e) 
+ avy][a2(1+é6y) —y(l—e*®— 62)] 
(21) 


+ [y2(l—e**) + ay] [y(1 +62) 
=e le) Oy te 


For x = y, equation (21) reduces to (14) of IV. For very small 
values of 6 (or, which is the same thing, for very small values of x 
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and y), it is seen that x + y is a first approximation. Taking some- 
what larger values of 6 (or of x and y), it can be verified that 
2(da +1)2+ y(6 y + 1) is a second approximation, which would 
be exact if a refractory period were assumed for N, and Nz but 
neglected for N;,. For large values of 6 (or for x and y) equation 
(21) reduces to 6“, the maximal frequency of N;. 

A. Shimbel (1949) used the following approximation for the 
output of N; : fs =f: + fe —6 fife, where fi, fe, and f, denote the 
respective outputs of N,, N., and N;.. In our notation this would 
read 


1/i=2(67 +441) t4+ yy wd) oayh Ga Dovel 


This approximation agrees closely with formula (21) for both very 
small and very large values of x andy. 


CASE 2. MULTIPLICATION OF FREQUENCIES. We may consider the 
“multiplication” of the inputs of N, and Nz in its simplest form by 
neglecting the refractory period. If successive stimuli from a single 
neuron can sum to fire N; , the case reduces to that of a single neuron 
subject to a Poisson shower of stimuli. This case was treated in II. 
We shall suppose next that two stimuli one from each shower are 
required to fire N;. Then, if x and y are the respective frequencies 
of the showers, a stimulus from a given shower is effective if, and 
only if, it is preceded within o units (where o is the period of latent 
addition) by a stimulus received from the other shower. Thus the 
probability that a stimulus from N, is effective will be 1 — e¥’, and 
the probability that a stimulus from N, is effective will be 1 — e*°. 
The frequency of effective stimuli will then be 


x(1—e7) + y(1—e*), (22) 


which for small values of x and y reduces to 2xyo. 

If, on the other hand, x and y are large, the frequency approach- 
es x+y and increases with x and y without bound. We wish to com- 
pute the frequency of N; where 6 is the refractory period of all three 
neurons and thus provides an upper bound (6-*) to their frequencies. 

For simplicity of treatment it will first be assumed that x = y. 
The generalization to the case of independent inputs x and y will 
then become apparent, and, moreover, the special case of equal inputs 
will provide a running check on the laborious computations. We shall 
also assume that « < 6/2. This condition insures that no three con- 


secutive stimuli impinging upon N, may fall within an interval of ¢ 
units. 
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We define as a “potentially effective” pair of stimuli a pair which 
fall within o units of each other. The condition ¢ < 6/2 implies not 
only that the stimuli of such a pair come from different neurons, but 
also that the potentially effective pairs are “isolated” from each other, 
i.e., a stimulus can belong to at most one such pair. 

Consider first the frequency of potentially effective pairs. It 
was shown in III that if an arbitrary instant is chosen on the time 
axis of, say, N, without knowledge of the time of its last firing, then 
the probability frequency distribution of its next (or last) firing is 
constant for an interval of length 6. That is, if the arbitrarily chosen 
instant is taken as the origin, the frequency distribution for the in- 
terval (0, 6) will be given by «(6% + 1)-. Therefore if a stimulus 
from N, is chosen, the probability that it is preceded by another stim- 
ulus within o units (which necessarily must come from N. since 
o <0) isxo(6x2 +1). Since the frequencies of N, and Nz are both 
x(d«% + 1) (cf. I), the total frequency of stimuli received by N; is 
evidently 2x(6a + 1). Therefore the frequency of those stimuli 
which are preceded within o by other stimuli (the frequency of po- 
tentially effective pairs) is 2%’0(6 x + 1). Not all of these pairs, 
however, can be effective in firing N;, since one or the other of a 
potentially effective pair may fall within the refractory period of N3;. 
Our task is now to compute the fraction of such potentially effective 
stimuli which are not actually effective. 


swt § t +0 
= to tf %t, 


FIGURE 1 


In Figure 1, 0 indicates the origin of time and also a firing in- 
stant of N,;. The neuron N; has fired at t = 0, because N, and Nz 
have fired within the interval 7 < oc, neither of the stimuli falling 
within the refractory period 6 of N3. 

Consider now the conditions under which the next pair of stim- 
uli falling within o units (a potentially effective pair) will be actually 
ineffective. This can happen only if one or both of the stimuli from 
N, and N, fall within the refractory period of N; which lasts until 
6 (see Fig. 1). But the stimulus from N, cannot fall within this 
interval since N, is itself refractory for 6 units following its firing at 
0. Thus a necessary condition for the ineffectiveness of the next po- 
tentially effective pair is that the stimulus from N, fall somewhere 
at t, <6. Because Nz is refractory from — r to 6 —7, tz falls in the 
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interval (6 — 7,6). 

Now the necessary and sufficient condition that the next poten- 
tially effective pair be ineffective is that in addition to the stimulus 
from N, falling at t., the stimulus from N, should fall at t,, such 
that t; — t < o. Otherwise the stimulus at t, will not be a member 
of a potentially effective pair, Ns will not fire in the interval 
(0, t + «), and the next potentially effective pair will fall wholly 
outside of 6, ie., it will be actually effective. Hence to compute the 
probability that the next potentially effective pair after t = 0 be in- 
effective is equivalent to computing the probability that the stimuli 
from N, and N, fall at t, and ¢, as shown in Figure 1. 

This probability, in view of the theory developed in III, is given 
by 


1 o 6 
fai a; { x“ exp(x6—2 t,—@ 7) dt, 
Co 0 6-T 
(23) 
toto 
. i x exp (2 é6— x t,) dt. 
6 


The limits of the last integral indicate that ¢, must fall beyond 
6, but it must fall within o of t, if there is to be a potentially effec- 
tive pair. The limits of the middle integral indicate that t, must fall 
beyond 6 — 7, but it must fall before 6 for the pair to be actually in- 
effective. The first integral indicates that + can range with equal 
probability from 0 to o for the effectiveness of the initial pair at —r 
and 0. The computation of f is straightforward and yields 


SS sere oer 
i BY carta aan (24) 
220 

The quantity f represents the fraction of actually ineffective 

pairs following actually effective pairs. But the only ineffective pairs 

are those following effective pairs, as shown by the argument above. 

Hence f represents the fraction of ineffective pairs for every effective 

pair. Therefore the fraction of all the pairs which are effective must 
be 


2a. , 
Po se Nagtef a ceae e (25) 
4g —= 3 Ae ee —— aad 


and this is the correction factor which must modify the frequency of 
potentially effective pairs to give us the actual frequency of N3;. 
The frequency of N; is thus seen to be 
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2x70 ta Axe? 

Ls SO: (26) 

(Oa+1)? (6%+1)?[4a0—8 + 4e7e — 27] 
For very small x , we have the approximation 2x20 , which indicates 
that one of every potentially effective pair of stimuli fires N 7b or 
very large x, equation (26) reduces to o/6?, and, in particular, if o 
takes its maximum permissible value, 6/2, the frequency will be 1/26. 
which is intuitively evident. 

To generalize (26) to the case in which the inputs impinging 
upon N, and Nz have independent frequencies x and y , we must com- 
pute f, and f., the respective probabilities that a potentially effec- 
tive pair following a firing due to N, or such a pair following a firing 
due to Nz is ineffective. The probability f, is given by a generaliza- 
tion of equation (23), namely, 


1 20 26 
A= | az | y exp(yd6—yt,— y 7) dt, 
o 0 6 


-T 


(27) 
te+o 
. { x exp(x6— 2x t,) dt,. 
6 
This yields 
ry) aed 6 1 le * aye (1?) (28) 


me xyo(“+y) 
It can be easily seen that f. is obtained from f, by interchanging x 
and y, namely, 
y(2--y) (tot €°—1) —x? (I—e*") +axye"* (1—e"7) 


2— 


(29) 
xyo (x+y) 


On the other hand, the frequency of potentially effective pairs is 
symmetric in x and y and is given by 2vyo[(6 # + 1) (Oy + 1)]”. 
Therefore for every two firings there are f,; + fs ineffective pairs, 
and the correction factor turns out to be 


2 


= ———— (30) 
fitfet2 


f 


Combining these results, we obtain our “multiplication theorem,” 
ie., the output of N; subject to stimuli from Ni and N. which must 


“sum” to produce a firing, namely, 
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27 y?ot(x t+ y)[(62 +1) (6y4+1)]“ 
X [x2 (2Qyo + e¥? —1) + y? (2x0 + €*7 —1) (31) 


+ wy (ee +.e%7 — e-@wr.— 1) J. 


For « = y, (31) reduces to (26). For small and for large values of x 
and y, approximations analogous to those in the previous case are ob- 


tained. 
This work was aided by a grant from the Dr. Wallace C. and 
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Simple chemical reactions of Ca++ and K+ ions with a P- ion in a 
membrane are assumed to be causes of changes in permeability of that 
membrane to K+ ions. On the basis of such a mechanism a quanta- 
tive concept of membrane permeability to K+ ions is defined. The dif- 
fusion of K+ ions through such a membrane is studied mathematically 
in a simplified version. An applied electrical field as well as the diffu- 
sion potential of the K+ ions are considered to effect the chemical equi- 
librium constants of the proposed reactions. It is shown that such a 
system, which can be described by a set of nonlinear differential equa- 
tions, may have two stable states of equilibrium which are separated by 
an unstable equilibrium state. As a consequence, such a system may 
possess a threshold. Estimations of resting potential, threshold—electri- 
cal as well as chemical—and of permeability increase, together with that 
of the corresponding electrical field strength are shown to have the cor- 
rect order of magnitude. A possible way to derive the one-factor theory 
from a physical mechanism as considered here is outlined. It is pointed 
out that the dependence of the thresholds and the permeability changes 
on several parameters might be calculated on the basis of such a mech- 
anism. As an example it is shown how in principle some of these rela- 
tions are derived. Furthermore, the time course of excitatory disturb- 
ance for different intensities of the initial disturbance are derived theo- 
retically for the case of chemical stimulation. The curves so obtained 
show a striking similarity in all the characteristic features with the cor- 
responding ones which are obtained experimentally for the case of elec- 
trical stimulation of nerve. These results suggest that response to elec- 
trical and chemical stimulation is based on a common threshold phenom- 
enon such as considered here. Finally, a more detailed mathematical de- 
scription, which takes into account explicitly the diffusion of K+ ions 
through the membrane for a finite thickness of the membrane is out- 
lined. The equations obtained, which seem to be infeasible of solution 
at the present time, suggest that it is plausible that relaxation oscil- 
lations with a threshold can be derived on the basis of such a mech- 
anism as proposed here. Qualitative agreement with experimental evi- ~ 


dence is indicated. 


The concept of threshold is a typical bio-mathematical concept 
and plays a basic role in electrophysiology (see, e.g., Schaefer, 1940) 
and the mathematical biology of the (central) nervous system (Ra- 
shevsky, 1948). Some suggestions as to the quantum-physical nature 
of this concept were made by A. Bethe (1937) and P. Jordan (1944). 
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An attempt to derive the all-or-none law on a probabilistic basis was 
made by H. von Schelling (1944). A formal attempt in connection 
with subthreshold potentials based on an electrical model of a mem- 
brane-element, the resistance of which is assumed to be a nonlinear 
function of the potential difference across the membrane, was made 
by B. Katz (1947). Another mechanism which exhibits threshold 
effects was derived by A. Rapoport (1950) from a probabilistic treat- 
ment of neural nets. 

I. Tasaki and T. Takeuchi (1941) obtained an action curve from 
one node. As an estimate of the length of one node we might take 
(cf., e.g., Engstrém and Liitky, 1950) as high a value as 3 X 10° cm, 
while the liminal length is of the order of 10°* cm (Rushton, 1937). 
From these estimates we may infer that propagation along one node 
is not very likely. The result mentioned above of Tasaki and Takeu- 
chi would then be an example of local excitation without propagation, 
which still gives an action curve. Furthermore, according to Tasaki 
(1940), the all-or-none law holds, in general, for excitation of a sin- 
gle node. 

One of the postulates of the formal theory of W.A.H. Rushton 
(1937) underlying much recent experimental work (e.g., Katz, 1937; 
1939) corresponds to a local threshold. It is, in fact, assumed in this 
theory that the local condition of the membrane suddenly changes 
after the resting charge of the membrane has been altered by a criti- 
cal amount 6,. From equation (8) of Rushton’s paper (loc. cit.), it 
follows that the liminal length is greater than zero only if the criti- 
cal value 6, is greater than zero. The latter would correspond to a 
local threshold. 


In the formal theory of Katz (loc. cit.) a threshold mechanism 
is discussed for one membrane-element only. This would, therefore, 
also correspond to a local threshold. 

A. M. Weinberg (1942) has pointed out that the only thing which 
can be concluded from the experimental curves describing the sub- 
liminal responses of nerve (Katz, 1937, and others mentioned in Wein- 
berg’s paper) is that the subsidence of the excitatory state « is not 
described by the usual linear differential equations of the one- or two- 
factor theories, but should be described by a nonlinear differential 
equation, in which the time-constant is itself a function of «. Fur- 
thermore, he has pointed out that an interpretation of the deviations 
from linearity in terms of a “local response” in accordance with Rush- 
ton’s notion of liminal length (loc. cit.; Katz, 1939) is somewhat irrel- 
evant from the strictly phenomenological viewpoint, a physical in- 
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terpretation of such a nonlinearity being outside the scope of a for- 
mal theory. A local threshold is postulated also in the two-factor 
theory of nerve conduction (Rashevsky, 1948). 

In the light of what has been mentioned above, one might believe 
that it is plausible that a local threshold exists. It is also conceivable 
that a threshold for the longitudinal gradient of the “polarization” 
exists (cf. Hichler, 1933; also Katz, 1937, 1939). And it may very 
well be that those thresholds, if they both exist, are not independent 
of each other; it is conceivable that the latter threshold might be 
derived from the former one. We realize that the phenomenon of 
excitation and propagation is probably described by one, or several, 
(very difficult) partial differential equations or integrodifferential 
equations. It would then seem to be artificial to concentrate too much 
on either the “local” or the “longitudinal” aspect of the problem. How- 
ever, these exact equations are still unknown at the present time. 
Therefore we will focus attention here on the local aspect of the pro- 
cess. After what has been said above in relation to work of Tasaki 
and Takeuchi, Tasaki, Rushton, Katz, and Weinberg, we feel justi- 
fied in doing this. We would like to remark here, however, that it 
is conceivable that even if the local response process is essentially 
a threshold phenomenon, it may appear to be graded (for a differ- 
ent suggestion, see Katz, 1937). This may be due to a combination 
of several factors. Namely, the stimulation may be “parametric,” by 
which we mean that as a result of the stimulation the conditions are 
changed with, as a consequence, a change in threshold of the response 
process. These changes and their consequences may very well be 
graded. Or, due to experimental techniques, the threshold of local 
stimulation may be masked either by stimulating too large an area 
(e.g., by electrical spread or leakage) or by large gradients at the 
borders of the stimulated area. In this relation the method used by 
Tasaki, Tasaki and Takeuchi in the papers mentioned above might be 
preferable. As a matter of fact, as stated above, they obtained an 
action current in this way from an excited region much shorter than 
the liminal length, while Tasaki concluded that the all-or-none law 
holds for such a process. 

The importance of relaxation oscillations in relation to the re- 
sponse of tissues to stimuli has been realized for some time. B. van 
der Pol and J. M. van der Mark (1928) have compared the rhythmic 
heartbeat with a relaxation oscillation and have pointed out several 
similarities between the two (e.g., heart block and demultiplication of 
frequency). A. Fessard (1931) also emphasized the similarity be- 
tween relaxation oscillations and neural rhythms. Later work of Fes- 
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sard (1936) and A. Arvanitaki (1938a, b; 1939) also points in the 
direction of the occurrence of relaxation oscillations in the case of 
stimulated nerves and myocardium. At this point it might also be of 
some interest to mention the model experiments of Bethe (1943), O. 
Schifer (1943), Bethe and H. Schaefer (1947). D. Auger (1936) 
found many similarities in this respect between the response of ani- 
mal and plant cells to stimulation. N. Rashevsky’s suggestion for de- 
riving a threshold for relaxation oscillations was elaborated by the 
author (Karreman, 1949), who was unaware that the theory of these 
so-called “hard oscillation” cases had already been developed to a 
certain extent previously (Andronow and Chaikin, 1937; 1949). 
The problem has been treated recently also by N. Minorsky (1947). 
These “hard oscillation” cases, with threshold properties, occur, 
mathematically speaking, if there are at least two limit cycles around 
a stable singular point, the first one being unstable, the second one 
stable (Fig. 1). 


x 


FIGURE 1. Phase plane representing a “hard oscillation.” S is a stable singu- 
lar point, 1 and 2 are limit cycles which are unstable and stable respectively. 


The theory of self-excited oscillations has been, to a large ex- 
tent, based on ideas and work of H. Poincaré (1892). Analytical 
treatments have also been given by, e.g., van der Pol (1926) and N. 
Kryloff and N. Bogoliuboff (1937; 1949). It has had many impor- 
tant applications in the case of oscillations of mechanical and elec- 
trical systems. In the case of excited muscle and nerve fibers transi- 
tion through an unstable state or region has been observed (Rush- 
ton, 1982; cf. also R. S. Lillie, 1923). Some theoretical approaches 
in this direction have been made by K. F. Bonhoeffer et al. (1948a 
through f), together with experimental investigations of the activa- 
tion of a passive iron wire as a model for the excitation of nerve. R. 
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S. Lillie had made similar investigations previously (1923, 1924) to 
get a model for the transmission of nerve impulses. Weinberg (1942) 
has pointed out the analogy with nonlinear electrical and mechanical 
systems by constructing a “local phenomenological characteristic” 
from experimental data (Katz, 1937) on subliminal responses of 
nerve. In Weinberg’s paper several other investigators are quoted 
who have suggested that the excitation equations are nonlinear. 


According to the Membrane Theory proposed by J. Bernstein 
(1902) the resting potential across the membrane of a nerve or mus- 
cle fiber is ascribed to the selective permeability to potassium ions 
which are present inside the fiber in a concentration which is higher 
than the normal concentration outside it. K. S. Cole and H. J. Curtis 
(1938) showed that during activity of Nitella the transverse resist- 
ance drops temporarily, suggesting a structural breakdown of the 
membrane and, as a result, a temporary loss of selective permeability 
to ions. They (loc. cit.; 1939) have also been able to show a close 
relation between the change of the membrane electromotive force and 
that of the conductance of the membrane. They consider the sudden 
changes in those quantities to be connected with the activity which 
is responsible for the all-or-none law and the initiation and propaga- 
tion of the nerve impulse. Their results for the squid giant axon cor- 
respond to those previously found for Nitella (Cole and Curtis, 
1938). There are also suggestions (cf. Schaefer, 1940) that potas- 
sium ions effect the structure of the membrane. Furthermore, A. L. 
Hodgkin (1947) has found that the conductance of the membrane is 
increased about three-fold when the external potassium concentration 
is trebled. In addition, there is some evidence that diffusion of potas- 
sium ions does play a role in the process of the development of the 
action potential of giant nerve fibers (Webb and Young, 1940) and 
that of the giant plant cell Nitella (Osterhout, 1934; Hill and Oster- 
hout, 1934). The latter investigators consider, as a good approxima- 
tion in the case of Nitella, the diffusion of potassium ions only.* 
Assuming with Cole and Curtis (1940) that the membrane conduct- 
ance is a measure of its permeability to ions we see that potassium 
ions cause an increase of that permeability, presumably (Schaefer, 
1940) by causing a so-called “loosening-up” of the membrane; while 
calcium ions, in general, decrease the ion permeability of the mem- 
brane (cf. R. S. Lillie, 1923; Schaefer, 1940). The increase in perme- 


igati Aki hich, un- 

*Very recent investigations (Hodgkin, Huxley, and Katz, 1949), Ww ; 
ee came to our attention after the major part of this work had pee 
done, indicate that the increase in the permeability to sodium precedes that of the 
permeability to potassium in the case of the squid giant nerve fiber during action. 
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ability at the cathode and the decrease in permeability at the anode 
might be due (cf. Schaefer, 1940) to this effect of K and Ca if we 
keep in mind that the concentration of K in the inside of the fiber is 
greater (and relatively more than that of Ca) than in the outside 
(Webb and Young, loc. cit; von Muralt, 1945) and both ions travel 
to the cathode from the anode. The importance of structural changes 
of surface membranes in relation to stimulation had already been 
stressed by R. S. Lillie (1923). It has been suggested (Heilbrunn, 
1943) that during stimulation Ca ions are released from a compound 
in the cell surface by, e.g., potassium ions. Heilbrunn considers as 
essential for stimulation a colloid chemical reaction produced by cal- 
cium, similar to blood clotting. He and his coworkers have given evi- 
dence to that effect in the case of stimulation of lower organisms (for 
example, Amoeba proteus). In our opinion this scheme is not very 
likely to account for fast reactions such as occur, e.g., during the 
stimulation of nerve fibers, since the duration of the action poten- 
tial is only of the order of milliseconds. We will, therefore, consider 
a membrane with only one or more layers composed of a calcium com- 
pound and will investigate here the diffusion of potassium ions 
through such a membrane, assuming that the latter are able to react 
with the calcium compound. According to R. Héber (1945) excita- 
tion is a reversible increase of permeability to ions. True enough, 
Ho6ber considers, with many others, the latter brought about by a 
transitory dispersion of the colloidal structure of the plasma mem- 
brane. These colloidal changes are still hard to treat theoretically, 
though recent advances in this field (cf. Verwey and Overbeek, 1948) 
might be very helpful. 


FIGURE 2. Model of a membrane which changes its permeability in accor- 
dance with the chemical reaction (1). The numerals I and II represent the inner 
and outer boundaries of the membrane. 


Model of a membrane. We will assume that a compound of Ca, 
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CaP, , in which P may represent a fatty acid ion or a lipoid constitu- 
ent, forms a layer in the membrane. The latter may be mono- or 
bimolecular (Hoéber, 1945; Danielli and Davson, 1935) as shown in 
Figure 2. Potassium ions are supposed to react with this compound 
(Heilbrunn, 1943) in the following way: 


CaP, + 2K* = Ca* + 2KP. (1) 


The KP formed by this reaction might be said to produce a 
“hole” (cf. Héber, 1945; Fleckenstein, 1942), as shown in Figure 2. 
This concept of hole is, of course, much too simple and much too 
crude. Instead, the interactions of the Ca*, P- and K+ ions should be 
taken into account in much more detail. Yet it is interesting to see 
what consequences, particularly quantitative ones, can be drawn 
from it. 

Instead of using the highly oversimplified reaction (1) we will 
investigate the more appropriate scheme: 


CaP, = CaP* + P-, (2) 
CaP+ = Cat + P-, (3) 
p- + K+= KP, * (4) 


which corresponds to Figure 3. 


Figure 3. Improved model of a membrane which changes its permeability 
in accordance with the chemical reactions (2), (8), and (4). 


We will call the equilibrium constants of these reactions K, , K., 
and K," respectively. We assume here that we may apply the laws 
of chemical equilibrium and use the concept of concentration even 
though we may be dealing with pauci-molecular layers in the mem- 
brane. Then we have: 
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(CaP2) 


K,=—__—_,, (5) 
(CaP*) (P>) 
oe Cle, (6) 
(Ca) (P-) 
Ke ES (7) 
(KP) 


where parenthesis denote molar concentrations. 
Denoting by P, the total concentration of P- we find: 


2( CaP.) (CaP) se. CK Pye es) nae (8) 


We regard the P, as determined by metabolic reactions. 

Elimination of (CaP.), (CaP*) and (KP) from (5), (6), (7) 
and (8) gives: 

2K,K,(Ca**) (P-)? + {Ke (Ca**) 
9 
+ K;,(K*) +1}(P-) —P,=0. oy 
This equation determines (P-). The two roots of (9) have different 
signs as can be easily seen. Of course, only the positive root of this 
equation for (P-) has a physical meaning. 

The first basic assumption we will make is as follows. The per- 
meability h of the membrane to potassium ions is proportional to the 
number of “holes.”’ The latter number we assume to be proportional 
to (KP) + (P-). Of course, more appropriate (better weighted) 
combinations of (KP), (P-), and (CaP*) might have been chosen. 
However, a serious difficulty, due to lack of sufficient available data, 
is to decide at present which combination or function of these con- 
centrations should be taken as a measure of the permeability of the 
membrane to potassium ions. The different fields of force between 
the ions should be taken into account, as well as the different masses 
and, therefore, different moments of inertia, the latter being impor- 
tant for the rotation of the molecules involved. 

On the basis of these assumptions we find: 


ah= (KP) + (P-), (10) 


a being a constant of proportionality. 
Through these assumptions, essentially described by equations 
(2), (3), (4), (8), and (10), the above-mentioned antagonistic in- 


fluences of Ca*+ and K* ions on the permeability of membranes is in- 
troduced. 
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Elimination of (KP) from (7) and (10) gives: 


ah=K;(K*) (P-) + (P-). (11) 
Solving (11) for (P-) we find: 
ah 
UP RSet (12) 
Ke(K*) +1 


Introducing (12) into (9) and rearranging leads us to: 
2K,K.(Ca**) a? K.(Ca**) + K,;(K*) +1 
———___——— fh? + anh Po==(; (13) 
th CRT es: Ke eal 


This quadratic equation in h determines h as a function of (K+) and 
ofe(Ca7). 
We now introduce the following notations: 


(K+) =c, 
2K,K.(Ca*)a@—=a, 


(14) 
K,; = b 5) 


K,(Ca*)a=d. 


We then can rewrite (13) for h =h(c) as follows: 


h(c) +ah(c) —P,=0. (15) 


a d 
——— /hA(c)$ + 
(be--1)# | | bee 


It is easy to see that the function h(c) determined by (15) is 


Lot 
Py 


OC) 


ho 


O C 
Ficure 4. Illustration of the relation between the permeability to potassium 
ions and the concentration of those ions in the membrane on the basis of a 
mechanism defined by equations (2), (3), (4), (8); and (10). 
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P. ; 
such that h(0) > 0, while h(c) approaches etre approaches in- 
a 
finity (which is, of course, beyond the physiological limits). Similarly, 


h= wild for (Cat) = 0, implying a= d= 0. Furthermore, it can be 

a 
shown [most easily by differentiation of both sides of (15) with 
respect to ¢ and using the fact that physiologically h(c) > 0] that 
h'(c) > 0 and h”(c) < 0 for all positive c [and h(c)]. Therefore, 
plotting h(c) against c we obtain a curve like the one shown in Fig- 
ure 4. 

For later use we will only mention here the dependence of the 
slope of the curve shown in Figure 4 on the concentration c. It is 
given by: 

{2ah(c) + df (c)} bh(c) 
h'(¢) = 


~ {2ah(e) + df(e) +af2(e)} fe)’ 
f(c) being an abbreviation for 
f(c)=1-4 hc. 


(16) 


The diffusion of potassium through the proposed membrane. We 
will now consider the diffusion of potassium through a membrane of 
the type outlined in Figure 3. We suppose, as is the case with sev- 
eral living membranes (Hoéber, 1945), that at one side of the mem- 
brane the potassium has the concentration K, and at the other side 
K,# K,. As is mentioned above, the approximation which considers 
the diffusion of potassium alone through one or more membranes is 
the one actually considered by S. E. Hill and W. J. V. Osterhout in a 


K,+ x I a 


ZERO-LEVEL OF 
ba arg and rien ner chart ek =~ Kt-1ON CONC. 
| MEMBRANE ! e 
FIGURE 5 
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long series of papers on the action potentials of the giant plant cell 
Nitella (e.g., Osterhout, 1934; Hill and Osterhout, 1934; Osterhout 
and Hill, 1934; 1938). These action potentials have some features in 
common with those of nerve and muscle fibers.* They also share com- 
mon features with those of the heart. According to Webb and Young 
(1940) the diffusion of potassium ions appears to be a principal fac- 
tor responsible for the development of the action potential. In nerve 
and muscle fibers, as well as in Nitella cells, the inside concentration 
k, of potassium ions is much greater than their outside concentra- 
tion K, (Hober, 1945). Therefore, we will consider the situation 
shown in Figure 5. 

At both sides of the membrane there are two layers. The first 
layer, of thickness L, , is the one in which the diffusion of potassium 
ions from the high potassium ion concentration K, to that (x) just 
inside the membrane occurs. The other layer, of thickness Ly , is the 
one in which the diffusion of potassium ions from the concentration 
(z) just outside the membrane to the low outside concentration Kz 
takes place. We will consider the average concentrations in these 
layers 


1 OG oer z+ Ke 
an 


respectively, use the approximation method developed by Rashevsky 
(1948) and take for the permeabilities of the inside layer I the 
value h(a); for that of the outside layer II the value h(y). Assum- 
ing the chemical reactions to be very fast compared with the diffu- 
sion processes, so that they may be considered always to be in equi- 
librium, we may suppose h(x) and h(y) to be determined by the same 
type of mechanism as described by equations (2), (3), and (4). There- 
fore, h(x) and h(y) themselves are determined by an equation of 
the type (15) with c = # and c= y, respectively. Using the approxi- 
mation method we now find the equations describing the conserva- 
tion of K+ ions in both diffusion layers and in the membrane: 


Ra at 
d 


2 ie eae 
AL, See AD, 
dt L, 


—Ah(x)(%—y), (17) 


*Very recently important differences have been pointed out (Hodgkin, 1949). 
+Where we speak of permeability in the future we mean the permeability 
to potassium ions. 
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Aj = Ane) (a—y) —Ah(z) (y—2), (18) 


Bains 


2, z—K. 
AL, ——— = Ah(z) (y—z) — AD : (19) 
dt L 


2 


A being the cross-section. 

Equations (17), (18), and (19) form a set of three simultane- 
ous, nonlinear differential equations. Though the mathematical treat- 
ment of this problem in its most simple approximated form leads to 
this difficult set of differential equations, it is not too difficult to see 
that the situation might physically correspond to an oscillatory phe- 
nomenon occurring after a threshold has been exceeded. Indeed, by 
raising the external concentration z in Figure 5 the permeability of 
II is increased since the function h(z) is a monotone increasing func- 
tion of z. As a consequence, the diffusion of potassium ions through 
layer II is increased. Therefore, y decreases and z increases initially. 
With decreasing y the difference between x and y increases so that 
the diffusion through I increases, lagging behind the increase of dif- 
fusion through II. The increased diffusion through I reduces 2. 
Therefore, the difference between K, and 2 is increased. In this model 
K, and K, are supposed to be constant (the metabolic process keep- 
ing K, constantly higher than K, is not considered here). The in- 
creased difference K, — x increases the diffusion in the inside layer 
which tends to restore the original value of «. At the same time the 
increase of z increases the diffusion in the outside layer, which de- 
creases z again. Assuming the initial increase of z is large enough 
(corresponding to a threshold) and the time rates of the various dif- 
fusion processes are of the right order of magnitude, we see that it 
might be possible for the original situation to be restored after prac- 
tically the whole content of the membrane of potassium ions has been 
released. 


It has been found (Hodgkin and Huxley, 1947) that the charge 
carried by the potassium ions released in one impulse is about twice 
the charge on the resting membrane. If one assumes that the latter 
charge is located in a layer with a thickness of comparable order to 
that of the membrane, then the above observation is compatible with 
the mechanism under consideration. 

Instead of treating here the set of equations (17), (18), and 
(19), we will consider first a simplified special case of it, which al- 
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ready shows some interesting features. We assume that the perme- 
ability of the membrane is determined by a similar type of mecha- 
nism. We again consider a CaP, layer as an essential element of the 
membrane. This compound of CaP, is in equilibrium with two com- 
plex ions CaP* and P~ according to: 


CaP, = CaP*++P-, (20) 

CaP* = Ca* + P-. (21) 
The equilibrium constants are, as before, 
(CaP) 


— — (22) 
(CaP*) (P-) 
and 
CaP+ 
(Ca*) (P-) 


We will now consider also some electrical effects. An applied 
cathode (Fig. 6) will shift the equilibrium of reactions (20) and (21) 
toward the right-hand side, if we assume that the Ca* ions are bound 
to relatively immobile P- ions. This effect is due to the positive charge 
of the CaP* and Ca** ions. 


KETTE ETE 
CO) 


Fiqure 6. Membrane M in the neighborhood of an applied cathode C illus- 
trating the effect of the attraction of the cathode on the Ca*+ ions. 


We may consider this effect as being due to the attraction of the 
Ca* ions by the negative electrode. This effect might be expressed 


by the equation: 
Kivy= Kio 2 es (24) 


in which 
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V = the electric potential difference across the CaP, layer. 


K;,y = the equilibrium constant of the ith reaction (20) or (21) 
(i = 1, 2) when the electric potential difference across the CaPs 
layer is V. Similarly for Kio. 


e = the charge of a monovalent ion. 
k =the Boltzmann constant. 
T =the absolute temperature. 


f =a correction factor which expresses the fact that the effec- 
tive charge of the Ca** ion might be less than 2 and the effective po- 
tential difference across the CaP, layer might be less than the meas- 
ured potential difference. Therefore, f may describe the potential 
drop inside the membrane and across another layer. We will estimate 
the value of f later and expect it to be less than 1. 

We suppose that the electric potential difference V is composed 
of the applied one (4 V) and the diffusion potential of potassium. 
The latter has been assumed here because Osterhout (1949) has shown 
that it is a good approximation for the resting potential in the case of 
Nitella (cf. also Fleckenstein, 1942). We then find: 


RT Cy 
V =—log.— + (AV), (25) 

F is 
c, and ¢ representing the concentrations of K* ions at both sides of 
the layer. Furthermore, it is assumed that the mobility of K+ ions 
through the membrane is so much higher than that of any anions (cf. 
Osterhout, 1949) that the diffusion of the latter does not need to be 
taken into account in the calculation of the diffusion potential. Intro- 
ducing (25) into (24) leads to: 


: ZERO-LEVEL 
OF KtION CONC. 


< ———— »s/< — - -- > 


L 


MEMBRANE 


FIGURE 7. Schematic illustration of the concentration levels on both sides of 
a membrane of negligible thickness. 
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C, oF ~ f (AV) 

Kiy = Kio ( = é : (26) 
Ce 

As we see, the influence of the electric effects makes the reactions (20) 

and (21) of higher order. 

To simplify matters in another respect, we will treat here first 
the case of a membrane whose thickness 4 may be neglected. Then we 
have the situation shown in Figure 7. 

The thickness of the diffusion layers on both sides of the mem- 
brane will be assumed to be the same as well as the diffusion constants 
of potassium in them. It will be shown, in a more general treatment 
below (cf. the last part of this paper), that these assumptions do not 
change anything significantly. 

In the same way as above, the diffusion in the two layers next to 
the membrane with thickness L is found to be described by the fol- 
lowing equations: 


Cheer? 
d 
a K,—2 
AL Se /00, —Ah(xz—y), (27) 
dt 
y+ K, 
d 
— k, 


2 y 
AL ——— = Ah(x—y) — AD (28) 


dt 


In these equations the permeability h is considered to be a func- 
tion of the potassium concentration inside the membrane. It will, 


0 
+ 
dries IONS | eux OFKTIONS | FLUX OF KTIONS IN OUTSIDE 
——P 
INMEMBRANE | Fux of NatiONS IN OUTSIDE 
a 
FLUX OF ANIONS IN INSIDE | FLUX OF ANIONS | FLUX OF ANIONS IN OUTSIDE 
ie wee i —_—m_ a ee = = en a= a= = 
IN MEMBRANE Se 
FLUX OF OTHER CATIONS 
IN INSIDE 
INSIDE OUTSIDE CATHODE 


MEMBRANE 


FIGURE 8 
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therefore, in general, be a function of and y. However, because 
is relatively high, the variations of x are relatively small. In addi- 
tion, under our assumption that the membrane is permeable only to 
potassium ions, more of the latter will arrive at the external side of 
the membrane than leave there. This is because of the continuity of 
the electric current which inside the membrane is carried by potas- 
sium ions and by anions. We have so far neglected the latter for rea- 
sons of mathematical simplicity. In the outside layer, however, it is 
carried for the most part by sodium ions and anions (Fig. 8). 


Therefore, the accumulation of K* ions will take place at the ex- 
ternal side II of the membrane. This is another reason why the per- 
meability is determined to a large extent by y in Figure 7. 

However it will be shown later on how the more general case, in 
which the permeability is also dependent on x , can be treated approxi- 
mately. 

Introducing h = h(y) into the equations (27) and (28) and 
simplifying the latter equations, we get: 


i Sas fg ae yaa (29) 
dt L? L 
dy 2h(y) 2) 
7g day ee og Maas aes (30) 


Because h is considered here to be a function of one of the dependent 
variables, this set of differential equations is also nonlinear. A set 
of this kind might be considered as a first extension of the usual equa- 
tions describing diffusion phenomena (including permeability fea- 
tures) with constant diffusion and permeability coefficients, as has 
been done in most cases up to the present time (Rashevsky, 1948; see 
also for discussion of some nonlinear diffusion cases). 


Addition of the two equations (29) and (30) gives 


d(x+y) 2D (K,4+K 2D 
di I? 1 2) "Te (2 +3). (31) 


The solution of (31) is: 


passe ; 2D 
ie 2D 
oe y—e* sey), liteok 2D UK, + K,)dt 
L? 
0 


or 
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Airy 
ety=(K,+ KK.) te” {(%+y).— (Ki+K2)}, (32): 


(x + Y)> being the initial value of (x + y). 
Hence in the steady state (for t > «) we have: 


ie = Koes 5 (33) 


as could have been obtained also directly from (31). 
Solving for x from (33), we get: 


x= (Ky+ K.) —y. (34) 
Subtraction of equation (30) from (29) gives: 
d(x—y) 2D 2D A4h(y) 
—_——— = — (K,— K,) —— (*x—y) — x—yY). 35 
a =; (t=) (@—y). (85) 


_ To calculate h(y) we will first make the very crude assumption 
that f = 1in (26). For c, and ¢ in (26) we take x and y respectively 
as in Figure 7. Then we find from (26): 


g \* Fay 
Kiv = Kio ( a e*T 4 (36) 
7] 
Since 
7] e< Ky, ? 
and hence 
y<<K,+K., 


therefore we may write: 
mh ee (37) 


Hence in this approximation, « may be taken as constant, as could have 
been concluded from our above considerations in connection with the 


relatively insignificant variations of x. 
We now introduce the following notations: 


K,+K,=24, (38) 


Kenko eae (39) 
Then we find from (36), together with (37) and (38), 


2€ 
Kiy=4Kin ety. (40) 
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From (22) and (23), in which we now take K;,v instead of K;, we 
obtain: 


(CaP.) = Ki,y(CaP*) (P-), (41) 
(CaP*) = K.,y(Ca**) (P-). (42) 

Introducing (42) into (41) we find 
(CaP.) = K,,yKo,y (Ca**) (P-)?. (43) 


We now have the following expression for the reaction system (20) 
and (21), which is similar to equation (8): 


2(CaP.) + (CaP*) + (P-) =P. (44) 


Introducing (40) into (41) and (48), and then the latter into 
(44) we obtain: 


Es. . (AV) Pa N 
32K, Keg Ate (Ca**) ae 
(45) 
2€ 
— — (AV) ie 
+ AK, A? e* (Ca**) Oe + (P-) —P,=0. 
y 
Similar to assumption (10), we assume that in the case of reactions 
(20)— (21) we have: 
adz—( Py. (46) 


Elimination of (P-) from (45) and (46) yields the following equa- 
tion for h: 


tay aX? 
32K, Koo At ext (Ca**) a? ( - a 
y? 
(47) 


Lay) h 
+ 4K. A? ek? (Ca*)a— +ah—P,=0. 
y 
We will now first show that the term ah may be neglected. To do this 
we have to find plausible estimates for the various parameters. F. C. 
McLean and A. B. Hastings (1935) find for body fluids: 


K,,o6Ko,. ~ 10°—10*. (48) 


Because we might expect Ki. << Ks, (secondary ionizations usually 
being “weaker” than primary ones), we estimate 
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Koo a 10?—105. (49) 


According to Webb and Young (1940) and R. S. Bear and F. O. 
Schmidt (1939) we have the following orders of magnitude: 


(K,) =3.107 M/1, (50) 
(Kz) =10? M/1. (51) 


Taking as an estimate of (Ca**) the value of that concentration 
in Ringer’s solution we find that the order of magnitude is (Fulton- 
Howell, 1946) : 


(Ga oO M/) (52) 
From (88), (50), and (51) we find: 
C= K, te Kz 
A ae 0.16 M/1. (53) 


Taking (AV) = 0 (no electric field applied), using (49) and (51) 
and keeping in mind that y is of the order of magnitude of K., we 
find for the coefficient of h in the second term in (47): 


4.10? (0.16)?- 10 (0.01)? a= 10’ a, 


which is large compared with the coefficient a of hf in the third term. 
We will, therefore, neglect the third term in (47). Equation (47) 
now takes the form: | 


£ =) +o = tet (54) 
y’ y? 


4E€ 
_. — (AY) 
G— ooh, Kha e7 (Ca**) a? (55) 


with 


and 
2€ 
b,=4K,, Pe (Cara. (56) 
The solution of (54) is given by 
h_ —b, + yb# + 4a,Po 


—_— co 


y? 2a, 


(57) 


or 
h=6By? (58) 
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bi paras ame 
Tee alta So es (59) 
PAD 20 Ay 


If we again assume as above that the conductance is a measure of 
the ionic permeability we may compare the theoretically derived re- 
sult (58) with the observation of Hodgkin (1947) who found that the 
conductance was increased roughly three-fold when the external po- 
tassium concentration was trebled. However, Hodgkin states that 
there is much scatter in the experimental results and he considers his 
estimate to be crude. The fact that this investigator also found that 
the conductance was reduced to about one-half by removal of potas- 
sium would suggest that a term h, , independent of y , should be added 
to the right-hand side of (58). This term has been omitted in our con- 
siderations for reasons of mathematical simplicity. In addition, our 
formulae (58) and (62) predict a definite dependence of this relation 
of the conductance and the external potassium concentration on the 
Ca** ion concentration and on an applied external electrical field. It 
would be interesting to compare these predictions with experimental 
data as soon as they are available to obtain some additional ideas for 
the improvement of this very crude theory. 


From (55) and (56) we have 


with 


-—=-— —; (60) 


Ps P, 
ee re : (61) 
82K oKo. Ate (Cat) a? 
Substitution of (60) and (61) into (59) leads us to: 
2€ 
—— (AV) 
B | Bret pie eens wham ce 
Ss j Bee ’ (62) 
K,(Ca“) | 16K,0 2? 


in which the index 0 has been omitted from the K’s. From (58) we 
see that h plotted as a function of the potassium concentration y gives 
a parabola (Fig. 9). From (58) and (62) we see that the perme- 
ability h decreases with increasing positive applied potential differ- 
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ence (4 V), while h increases with increasing negative applied po- 
tential difference (4 V). Furthermore, h decreases with increasing 
(Ca**), while h increases with increasing y, the potassium ion con- 
centration just outside the membrane. All these qualitative conclu- 
sions are in agreement with experimental evidence (R. S. Lillie, 1924; 
Hober, 1945). 


£0) 


y 


FIGURE 9. Permeability to potassium ions as a function of the potassium 
concentration just outside the membrane according to equation (58). 


Substitution of (39) and (58) into (35) gives 


at—y) — 4 
CONE an ee ea (63) 
dt Lb 
where 
2D 
k=—. (64) 
L? 
For simplification of notation let us write 
4B 
Say (65) 
L 
and introduce as new variables 
+ 
Sees e (66) 
2 
x ——_ 
ee (67) 
2 
so that 
g=utyv (68) 
and 
y=u—. (69) 


From (66), (33) and (38) we then have: 


210 MATHEMATICAL BIOLOGY OF EXCITATION 


Theale, 


Introduction of (65), (67), (69) and (70) into (63) yields: 


dv — = 
—=ke—kv—y(A—v)%. 
dt 


Rearranging (71) and putting 


Vv 
=—¢; 
A 
€ 
we 
A 
we find: 
dé 
— = k{2— (1+ $)é +26 2 —ce*), 
dt 
where 


y a? 
== 


is a dimensionless constant, and z now has a new meaning. 
Denoting the right-hand side of (74) by kF'(&), so that 


F(é) =2— Usp oye 25.365 


we have: 


ae ure) 
dt 


(70) 


(71) 


(72) 


(73) 


(74) 


(75) 


(76) 


(77) 


Because « > 0 and y > 0, we see from (33), (66) and (67) that v 
and, therefore, according to (72), €, is as great as possible if x is as 
large as possible and y as small as possible. Physiologically this oc- 


curs [(x + y)/2 =u being constant according to (33)] when 


y=0. 
Accordingly, using (66), (70), and (78) we find 
i= 2 7, 
This means, according to (38),x=K,+K,>K,. 


(78) 


(79) 
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Therefore, we see from (67), (78), and (79) that the maxi- 
mum value of v is 


A. (80) 
and that of €, according to (72), is 
sis (81) 


We now find from (76) that 


F(0)=z>0, 
F(1)=z—-1<0. 


Figure 10. Illustration of the relation between = and € for K, — 50 and 


K, = 1 mM/1 and for several values of 5. The permeability is given by (58) 
and (62). 


A plausible value of z is 0.96. According to (50) and (51), we have 
z= 0.94 , while in experiments on Nitella z = 0.96 (Osterhout, 1949). 
The function F(é) has been plotted against € for z = 0.96 and for 
several values of the parameter 6 in Figure 10. If 


3.65 < 8 < 6.85, (82) 


then F'(é) has three roots in the interval 0 < é < 1. Therefore, we 
see that there are three equilibrium values of € and hence of w and y 
also. As it can easily be seen from (74) or from Figure 10, the small- 
est root of F(é) corresponds to a stable equilibrium, the next larger 
one to an unstable equilibrium, and the largest one again to a stable 
equilibrium. As an illustration, we find that for 5 = 5, K, = 0.050 
M/1, and K, = 0.001 M/1 (Osterhout, 1949), these roots are 
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&, = 0.254 corresponding to y, = 0.019 and #, = 0.082 , (83) 
&. = 0.800 « > yo=0.005 ” a=0.046, (84) 
&, = 0.946 cs ” y,=0.0014” %=0.050. (85) 


The value y; gives the potassium ion concentration in M/1 just out- 
side the membrane in the original state. Adding slightly more potas- 
sium than y. — y; = 0.0036 (M/l) just outside the membrane does 
change the equilibrium state into y, = 0.019 (M/1). As we see here, 
Yo — Ys corresponds to a threshold of chemical stimulation through the 
local addition of potassium outside the membrane. We find here a 
value of the right order of magnitude (Hill and Osterhout, 1937). 
Because of the very crude assumptions which we have made, particu- 
larly the assumption of f = 1 in (26), we can hardly expect more 
than agreement of the orders of magnitude, and it is not worthwhile 
to go into the details here. However, at this stage we can estimate the 
value of f. As we have seen above, increasing the (Ca**) decreases 
6 according to (62) and, therefore, also, according to (65) and (75), 
5. As we see from Figure 10 decreasing § moves the two smaller roots 
toward each other until they coincide. At the same time the distance 
between the two larger roots, corresponding to the threshold, increas- 
es. For still smaller values of 5 than that one which corresponds to 
the coincidence of the two smallest roots, there remains only the 
largest root corresponding to the original state. This dependence of 
the threshold on the (Ca*+) might correspond to the experimentally 
observed increase of threshold (or decreased irritability) upon add- 
ing Ca* ions (R. S. Lillie, 1923; Hodber, 1945). Also, the original 
state might be restored (cf. Osterhout, 1933 for a restoration of ir- 
ritability by adding CaCl. solution). Adding more Ca* than corre- 
sponds to the smallest value of 5 , for which there are still three roots, 
might also lead to Ca narcosis. The quantity 8 increases with increas- 
ing applied negative potential difference, as is seen from (62). There- 
fore, according to (65) and (75), 8 also increases. Hence this mech- 
anism also gives parametric electrical stimulation by an applied cath- 
ode. To estimate f we will consider the case in which 8 is changed in 
such a way that starting with the situation 8 = 3.65 in which the two 
smaller roots coincide we arrive at the situation § = 6.85 in which the 
two larger roots coincide. The latter corresponds to zero threshold. 
We have then, from (62), 


2€ 
SENAY 6.85 
| ate es 65 wo 
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From the known values of « and k, taking T = 300°K , We find: 
f(AV)my=—8. 


Assuming that (A V) my = — 10, which is experimentally of the right 
order of magnitude (Cole and Curtis, 1940), we find 
jo 0 oe (87) 


Though higher than we might have expected, we find a value of f 
which is less than 1 for a reasonable value of the electrical thresh- 
old. 

Using this value of f we find for the “resting potential’ which 
corresponds to the original state, according to (85), a potential 
difference of the order 

50 < 10° 


98 (0.8) log —_———_—_-= 72 (mV), (88) 
14 x 10~ 


10 
which is of the right order of magnitude (Osterhout, 1949). 
Similarly, we may make an estimate of the “action potential.” 
From equations (83) and (85) we find the value 


32 X 10-8 50 X 1072 
ES 0 ae ee 
19 X 10-8 14 X 104 


which is also of the correct order of magnitude (Hill and Osterhout, 
1934). This would correspond to the depolarization of the mem- 
brane. 

Furthermore, the permeability increase corresponding to the 
transition from the original state to the final state is, according to 
(58), (83), and (85), 


19 X 10? \? 
—_—— | = 200. (90) 
14 X 10+ 


This is also the right order of magnitude (Cole and Curtis, 1938). 
However, it should be realized here that this is only a very crude re- 
sult; the range of validity of our approximation should be investi- 
gated. Particularly, the neglect of the third term in (47) is no longer 
justified; however, the order of magnitude probably does not change 
significantly due to this approximation. 

It might be interesting to show here a possible way in which the 
one-factor theory may be obtained from the above considerations. 

We shall show this also as an example for the particular case 


)=-5 (mV), (89) 
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3 =5. From the curve in Figure 10, corresponding to this value, we 
have the equation 


&=k(0.96—6& +10 &—5 &) 
or 
—&=—5k(é— 0.254) (€ — 0.800) (€ — 0.946). (91) 


We then have for the slope at the value = 0.946 , corresponding to 
the original equilibrium state, 


dé 
es ) Seni 
dé J §=0.946 


Therefore, approximating the curve given by (91) by its tangent at 
the point (0.946, 0), we have from Taylor’s theorem, keeping only 
terms linear in &: 


E=é,.,, —0.5k(é— 0.946). (92) 


The initial value of €, &:ni:., due to stimulus or “impulse,” is 


t 


are == j E dt=C, (93) 


where C is a constant, which, however, depends on the time course of 
the stimulus. 
Substituting into (92) for é its value from (72) and (67), using 


eS > 
2A 


and keeping in mind that according to our approximation (33) 


a+y=0, 


we find: 


y =C,—0.5k(y — 0.0014), (94) 


where C, is proportional to C and 0.0014 M/1 is the value of y in the 
original equilibrium state. From (94) we see that our k turns out to 


have the order of magnitude of the time factor of the one-factor 
theory. 
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FIGURE 11. Chemical threshold of (K+) in mM/] as a function of the para- 
meter 6 for the case K, — 50 and K, = 1mM/1. 


The threshold can be determined as a function of 5 from the 
curves in Figure 10. The relation obtained is plotted in Figure 11. 
From this curve and equations (62), (65), and (75) we can obtain 
the dependence of this (chemical) threshold on the Ca* ion concen- 
tration and on the applied (4 V). Assuming (4 V) = 0 and § = 8.65 
we obtain in the above outlined way: 


(Ca**) 
in fractions of Chem. Threshold 
(Cat), in mM/1 (K+) 
0.285 0.00 
0.300 0.50 
0.333 1.20 
0.400 2.25 
0.500 8.50 
0.600 4.50 
0.667 5.20 
0.833 7.20 
0.900 8.00 
1.000 10.00 


This relation between the chemical threshold and (Ca**) is shown in 
Figure 12 in which the curve for § = 5 has also been drawn. 
In a similar way, which has also been outlined above, we can 
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FiGcurE 12. Chemical threshold of (K+) in mM/I as a function of 
(Cat) /(Ca*) , for K, = 50 and K,=1mM/1. 
Two curves are shown, one for the initial value 3.65 for the parameter 5 (cor- 
responding to Ca++ “narcosis’’) and the other for the initial value 5 of that para- 
meter. (Cat), is the value of (Ca++) corresponding to those values of 6. To 
simplify the calculations we have chosen for 8K,P, >> K.- 


calculate the dependence of the threshold of chemical stimulation on 
the applied field (A V) which gives us the following table if we as- 
sume the electrical threshold to be —5 mV: 


8 


' 2 3 4 5 > -(V)n mV 


FIcuRE 13, Chemical threshold of (K+) in mM/1 as a function of an applied 
voltage for K, = 50, K, —1mM/1, 6, =5. The symbol 5, denotes the value 


of the parameter 5 when (AV) = 0. The electrical threshold is taken to be 
— 5 mV. 


GEORGE KARREMAN 217 


(AV) Chem. Threshold 
(in mV) in mM/] K+ 
0 3.80 
—1 3.15 
—2 2.42 
a 1.66 
—4 0.86 
=-5 0.00 


This relation between the threshold and (A V) is shown in Fig- 
ure 13. 


It would be interesting to compare these results with experi- 
mental data. As far as we know these are not yet available. The elec- 
trical threshold can be expressed as a function of the Ca* ion con- 
centration. From (62), (65), and (75), we find, after some simple 
rearrangements: 


C 
—1+,/1+ 
(Ca**) 


ape rag 
meee AY.) ig 0, (95) 
kT ~ (Cat+) kT n 6} 
—1+ | 1+ 
(Ca**) n 
in which (AV); is the electrical threshold corresponding to the 


(Ca++) 
particular (Ca**) ion concentration, while (A V)m, and (Ca**), are 
n 
the corresponding normal (or, as we may assume, initial) values, and 


C is a constant equal to 
8K,P, 
(Gea 


We can then determine (A V) tnr from: 


2€ 
— f (AV) thr 
n 


6.85 = 5, e * (96) 


8, being the initial value of 6. 
Taking as a plausible value of 5, = 5 we find, with f = 0.8, 


(A V) thr rs 5 mv. 
We then obtain from (95) the ratio 
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(A V) thr 


(Ca++) 
(A V) thr 
n 


as a function of (Ca**). The graph of this function (Fig. 14) may 
be compared with the corresponding one of F. Brink and others 
(1946). Though the trend is correct, our range of (Ca**) values in 
which the threshold changes from 0 to its normal value is much 
smaller than the one found experimentally. This might perhaps be 
explained by the fact that local stimulation may be more sensitive 
than general stimulation or that we have not considered any accom- 
modation effects. The discrepancy is, however, probably due to the 
erudeness of the theory e.g., the neglect of permeability of Ca* ions 
and of diffusion potential of Ca* ions. 


FIGURE 14. Electrical threshold expressed in fractions of its initial value as 
a function of the ratio (Ca++) /(Ca*+),, for the initial value §, = 5 , K, = 50 mM/1 
and K, = 1mM/1. The initial or normal value (Cat), of (Ca++) corresponds 
to 6, and the normal threshold. In the same graph the experimental values, 
indicated by x and an experimental curve (Brink and others, 1946) are shown. 


Unfortunately, the duration of the “action potential” given by 
the integral 


0.254 d £ 
wg (§— 0.254) (€ — 0.800) (E — 0.946) 


for the case § = 5 is much too large. A slight overshoot of the un- 
stable root 0.8 had to be taken to keep ¢ finite. However, the integral 
in (97) remains infinite as long as 0.254 is taken as the upper limit. 
Therefore, the latter also must be slightly shifted. Presumably a 
recovery process takes place which is superimposed on the considered 


GEORGE KARREMAN 219 


process and keeps ¢ finite. The infinite value of ¢, as obtained from 


(97), is due to the fact that € = 0 at the roots. A similar phenome- 
non is discussed by Rashevsky (1948, p. 347) in the two-factor the- 
ory of conduction. 

Another interesting dependence of the threshold is that on K. ; 
the level of the external potassium concentration. This, however, will 
be postponed until after a more general analytical treatment is given 
below. Qualitatively it can be seen already, however, that an increase 
of K,, which decreases the quantity 


shifts all the curves in Figure 10 toward smaller values of the ordi- 
nates. It, also, therefore, decreases the difference of the two largest 
roots of F'(é), (the threshold), which is in agreement with experi- 
mental evidence (Hober, 1945). 

On the basis of the value 0.8 for f we can estimate the electrical 
field strength corresponding to an applied electrical potential differ- 
ence of — 10 mV. Even if we assume the latter across a distance of 
1 A we find 8 X 10° V/cm for the field strength. This does not com- 
pare too unfavorably with an estimate of Osterhout (1934) of 
2 X10* V/mm or more. It would be only slightly worse in this respect 
for f =1. We see that this model gives some results which are in 
agreement with experimental facts, at least as far as the order of 
magnitude is concerned. It is interesting to point out that these re- 
sults, which give absolute values, were obtained by using the abso- 
lute value of the unit electric charge and the Boltzmann constant. 
However, the fact that the value of f is found to be different from 
unity indicates that a more general treatment is necessary. The pow- 
er of y in (58) should be made equal to nf = p instead of 2 where 
n is the charge of the ion bound to the P- ion. We have hitherto iden- 
tified this ion with Ca**. To illustrate this more general treatment and 
to make some use of it, we proceed as follows. Taking, instead of 
(58), 

h= By’, (98) 


we obtain, instead of (74), in the same way as before: 


gE=h {2z—&— B(1—&)" 4}, ( 99) 


where the parameters k, z, and f have the same meaning as above, 
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cf. (64), (73), and (62) respectively. 
Putting, as before, 


é 
G (8) ef ies (100) 


we will determine here as functions of p the critical values 6, and 
B. of B for which the two smaller roots in Figure 15 and the two larger 
Ke) 
09 
08 
07 
06 
0.5 
04 
03 
02 


Ol O02 03 04 05 O06 07 08 09 10 
FIGURE 15 


roots respectively coincide. These values of § have given us before, 
for the case p = 2, the order of magnitude of the electrical threshold. 
Introducing as a new independent variable 


n=1—é, (101) 
we have 
G(é) =H(n) =z—-1+y— By? (1—7) 
102 
eines euky/ Meare Leach IE ee 
in which 
e=1—<. (108) 


The curves 
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E=kG(é) 
or 
= — kH (n) 
are tangent to the & or 7 axis respectively, if: 
H (ny) =0 (104) 
and 
H'(n) =1—p By + (pt+1)B yr=0. (105) 
The latter leads to: 
1 
s= (106) 


pn?*— (p+1)x 


Substitution of (106) into (104) yields the following quadratic equa- 
tion in 7 using (102): 


pn? —n{e(p+1) + (»—1)}+ep=—0 (107) 
the solution of which is: 


e(pri1) + (p—1) = {e(pt1) + (y—1) }*—4ep’ 
Ute = MEME en oh aS ag 
It can easily be seen that the expression under the sign of the radical, 


(pee) eo = 2p) et 1) 2770 


(108) 


if 
ee Piagh) (109) 
(pr L)? 


From (106) and (108) we find the critical values 6, and 2 for which 


we are looking. 
In this way the quantities §, and #6, and, therefore, of 


can be expressed as functions of e and p. Because, according to 
(103), (73), (39), and (38), 


222, MATHEMATICAL BIOLOGY OF EXCITATION 


mV 
=n Ge G 


— ELECT. THRESH. IN 
act 


o-N wea YY oO O 


& 


i is 2 2528 {K*) 
a 


FicurE 16. The negative of the electrical threshold as a function of 
(K,*)/(K,*),- For reasons of mathematical simplicity it is assumed that the 
original state corresponds to “narcosis.” K, = 50 mM/1 (K,*), was actually 


taken in this illustration equal to 1 mM/l. (K,*)/(K,*), has been plotted on a 
logarithmic scale. 


oV' 
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FicurE 17. Applied potential (AV) vs. membrane current intensity J. In 
order to make these calculations it was necessary to assume that the perme- 
ability approached 1/7 of its normal value [when (AV) = 0] for small K+ ion 
concentration and that in series with the membrane there is an additional re- 
sistance 1/12 of that due to the resistance of the normal membrane. 
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. . 32 
we can calculate in this way — and, therefore (A V) as a function of 


5 
K; for a fixed value of p, e.g., p= 2, and for a fixed value of K,, using 
the relation 


3, (110) 


obtained from (62) together with (65) and (75). The result of this 
calculation for K, = 0.050 M/1 is shown in the following table: The 
calculations were made for a Ca* concentration which corresponds to 
calcium “narcosis.” This particular case was chosen only as an illus- 
tration. 


K, (in M/1) —(AV) (in mV) 
0.0005 20.1 
0.0010 10.4 
0.0015 5.8 
0.0020 1.7 
0.0025 0.6 
0.0028 0.0 


where f is again taken as 0.8. 

This relation is shown in Figure 16.* 

Furthermore, we can derive from (62) and (58) the permeabil- 
ity h and, therefore, the resistance for negative as well as positive 
(AV) (Fig. 17). This may be compared with Figure 5 of K. S. Cole 
and H. J. Curtis (1940). The difference may be explained by the fact 
that we have neglected accommodation effects. On the other hand, it 
might be interesting to point out here that in principle our formulae 
predict influences of (Ca**) and K, on these rectification curves. 

We shall show here how in principle the so-called time-course of 
the excitatory disturbance and “local non-conducted response” (Katz, 
1937) can be obtained from our mechanism. To illustrate this let us 
again take, only as an example, the curve in Figure 10 for 5 = 5, 
which is given by equation (91). 

Denoting by 


C—— (é— 0.946) (111) 
we find from (91) 


*The linear relation between the electrical threshold and the external po- 
tassium ion concentration for low K, has recently been observed by Howard 


Jenerick (personal communication). 
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— t= + 5k (0.692 —C) (0.146 —L)¢. (112) 


The graph of i given by (112) against ¢ as shown in Figure 18a. 
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FIGURE 18a. The “local phenomenological characteristic” for K, = 50 mM/1, 
K, = 1 mM/1 and $6 =5. 


This corresponds to the “local phenomenological characteristic’ 


of Weinberg (1942). Weinberg’s instantaneous time constant, k, is 
‘plotted in Figure 18b: 


a et SS — 
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FIGURE 18b. The “instantaneous time constant” for K, = 50, K, = 1 mM/1 
and 6 = 5. 
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FIcuRE 19. a. Graph of standardized excitability p = 116 


of kt. b. “Local response” for positive initial disturbance p, as a function of kt. 
c. “Local response” for negative initial disturbance Pp, aS a function of kt. d. 
Maximum height of the “local response” as a function of p,. This figure is based 
on K, = 50. mM/1, K, —=1mM/landd=5. 


¢ 
Bee Ges) (0.692 —¢). (118) 
Integration of (112) leads us to: 
p ih, — 
ki-——— 0.46! 9.9 log —— 2.65 log ——__——_ — 12.5 log , (114) 
Po 4.74 — po La Po 
where 
's s 
a (115) 
0.146 0.146 


K. 
It may be well to point out that for a fixed value of — the coefficients 
'2 
in (114) are determined by only one parameter 5 which was assigned 
the value 5 in this example. 
For several values of po the relation between p and (kt) given 
by equation (114) is plotted in Figure 19a. These curves show a 
striking similarity in all their characteristic features with the curves 
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of Katz (1937, p. 252), which were obtained by electrical shock stimu- 
lation of nerve. In Figures 19b, c and d the time-course of the so- 
called “local response’ and the maximum height of the “local re- 
sponse” respectively are plotted, the latter as a function of the stimu- 
lus. This should hold for chemical stimulation. Experiments with 
which these curves could be checked do not seem to exist yet; at least 
they are unknown to this author. It would, therefore, be very inter- 
esting to attempt to obtain these curves by chemical stimulation of 
nerve or Nitella, which may be easier experimentally due to smaller 
speed of response. However, in Figure 20 the theoretical curves for 
‘ 
mt = 30 and § = 4 describing the time course of the chemical excit- 
2 
ability are compared with the corresponding curves of Katz (1937) 
which describe the time course of the electrical excitability. Figure 
19c indicates the presence of a “local response” for values of y which 


20 


as 


kt 


) 5 10 1S 
FicuRE 20. Comparison of experimental results (Katz, 1937) with theo- 


retical curves, which were obtained for —* — 30 and 5 = 4 and are described 
by the equation: ; 
p 1—p 2.7 — p 
kt = — 5.82 log,, — + 8.45 log — 3.18 lo : 
oe ee ae £10 2.0 pe 


are lower than the concentration of potassium in the resting state. 
These lower concentrations correspond in the case of electrical stimu- 
lation to anodal shocks for which no local response is supposed to 
occur (Katz 1937). Experimental results of Katz (loc. cit.) indicate 
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the same thing. This corroborates a remark made by Weinberg 
(1942). As Weinberg (1942) has pointed out so aptly, the only thing 
which can be concluded from the experimental results is that the 
phenomenon cannot be described by linear equations. The so-called 
“local response” which is found for cathodal as well as anodal stim- 
uli is nothing but this deviation from linearity and cannot be inter- 
preted, strictly speaking, on the basis of a phenomenological (Katz, 
1937; Rushton, 1937; Weinberg, 1942) theory as had already been 
pointed out by Weinberg (1942). The theory proposed here, based 
on a physical mechanism of diffusing potassium ions reacting chemi- 
cally with a CaP, layer, crude as it is, explains these deviations in a 
plausible way. It predicts that the time constant should be propor- 
tional to the square of the intensity of large anodic stimuli. This 
has not been considered by Weinberg (1942). Actually, the instan- 
taneous time constant of the experimental curves for anodic stimuli 
(Katz, 1937) do depend on the intensity of those stimuli. However, 
we have not been able to verify the above-mentioned predicted asymp- 
totic dependence. The reason might be that the experimentally used 
intensities of the anodic stimuli are not large enough for that purpose. 

If the response of the nerve is due to a mechanism such as dis- 
cussed above, then the so-called “threshold” in Figure 5 of Katz (loc. 
cit.) is no threshold at all. This is seen from our Figure 19b, which is 
probably more accurate than Katz’s figure. Our figure is based on 
equation (114). The “local response” for pp = 0.5 may not be so 
slight at all, as shown in our Figure 19b. Therefore, for this phe- 
nomenon, relation between maxium height of “local response” and 
intensity of initial stimulus, there seems to be no threshold. On the 
basis of our theory the real threshold phenomenon would be more 
hidden; the “local response” derived from it only appears “graded.” 
If experimental results for chemical stimulation confirmed these pre- 
dictions, it seems to us that those results, together with those of Katz 
(1937), would suggest a common underlying physical mechanism for 
excitation by chemical and electrical stimulation. The physical mech- 
anism proposed here, crude as it is, might then be a plausible one, 
since it is based on the flow of ions which must occur in some way or 
another in electrical stimulation and may also occur in chemical stim- 
ulation, as shown in this work. 

It might be interesting to point out here that from the formulae 
above a dependence of the curves in Figures 19a, b, c and d on, e.g., 
Ca++ and K, may be derived in principle. As soon as experimental 
data are available it would be useful to derive those dependences and 
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compare them with the experimental results to obtain some more 
clues for improving the theory proposed here. There are obviously 
several possibilities for this. 

Thus what may appear to be a graded local response may ac- 
tually be due to a “local threshold.” 

Therefore, we would like to end this part of our work with the 
conclusion that we do not necessarily have to assume that a threshold 
for a longitudinal gradient [e.g., the longitudinal polarization, cf. 
Hichler (1933; Katz, 1939) ] is the only part of the threshold essential 
for excitation, but that the latter may very well be due to a local 
threshold phenomenon as has been assumed in this work. 

Outline of an approximate method for treating the diffusion of 
ions through the membrane. We will in the remaining part of this 
paper consider a membrane with a finite thickness 4. However, with 
a rather good approximation, this may be considered to be small com- 
pared with L, if we take the plausible value 10-* cm for L, since A 
is of the order of 10-* cm (Davson and Danielli, 1943). (See Figure 
21.) 


Ae 
(MEMBRANE) 
FIGURE 21. Illustration of approximate treatment of the diffusion of K+ ions 


through the membrane. The numerals I and II correspond to the vertical “bound- 
aries” in this figure. 


To simplify matters we will assume that we may take an average 
concentration y of the potassium ion in the membrane. Actually we 
should, of course, consider a gradient of the potassium concentration 
inside the membrane corresponding to the slanted dotted line between 
I and II in Figure 21. 

Denoting by x and z respectively the concentration of potassium 
ions at I and II, we find in a similar way as above a description of 
the system by the following set of differential equations: 
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Ii: D, K 

Se im — hf’ — 5 

- L, ( x) (x — y) (116) 
Ay=h (x—y) —h" (y—z), (117) 
Le a h” ( D, 

—2z=h" (y—z) —— (z—K.,), 11 
5 Y= 2) f. (z ) (118) 


in which h ’ and h” are the permeability to potassium ions of I and II 
respectively. 

We are particularly interested in seeing if it is now possible to 
get a relaxation oscillation with threshold property if we take a 
linear dependence of the permeability on the potassium ion concen- 
tration. The reason for our interest in relaxation oscillations will be 
clear from the first part of this paper. The linear dependence of the 
permeability upon the potassium concentration follows, for suffici- 
ently small potassium ion concentrations, from the mechanism illus- 
trated in Figure 3 and described by the corresponding equations 
(2)— (16), as shown by Figure 4. The above mentioned experimen- 
tal relation between the conductance of the membrane and the ex- 
ternal potassium ion concentration found by Hodgkin (1947) is also 
more in line with such a dependence than (58). In addition, it would 
be of some interest if it could be shown that it might be possible to 
get a threshold phenomenon with such a linear dependence leading to 
a concave curve as in Figure 4. In other words, to show that a con- 
vex curve is not necessary. 

Therefore we will now assume 


hs :Cia, (119) 
i Cook (120) 


However, before introducing (119) and (120) into (116), (117), 
and (118) we will reduce the latter system to two first-order non- 
linear differential equations. It would be very advantageous if it were 
possible to solve the set (116), (117), and (118) using such simple 
assumptions as in (119) and (120), but we have not been able to do 
so. The main difficulty is that the set (116), (117), and (118) con- 
sists of three nonlinear equations. Such third order systems have 
been solved, as far as we know, only in a few special cases (Fried- 
richs, 1946; Rauch, 1950). The reduction to a second-order set men- 
tioned above is done in the following way, using our assumption about 
the smallness of 4 relative to L, and L.. Solving (117) for y gives: 
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hz+h'z—a 
y= sibel, Sina acd (121) 
hk 
Substitution of (121) into (116) and (118) yields, after some ele- 
mentary rearrangements, 


Ty - D, (K ' Wh" ( h' ; . (122) 
—x=—— (K,—2) — x—2z)— y 
2 L, h' + h” h' + h” 
and 
Le hh’ b, u” : 
— z= (a —z) —— (2e—K,.) — AY. (128) 
Pe! italia h' + h” 


It is interesting to note that in the latter system the harmonic aver- 
ages of the permeabilities h’ and h” of the two layers I and II occurs 
essentially as effective “all-over” permeability of the whole membrane, 


if we neglect the terms with y because of the smallness of 
A A 


— and —. 
1 


Now introducing (119) and (120) into (122) and (123) we find 


L, be D, (K ) C,Csx2 ( ) 1 Ci" a 
Se hae Some Uit ) a eae eR a ey) a 124 
2 Ly Ct = Caz C32 ae Coz Md ( 
and 
i CiC.xz ( ’ ey K) C.z . 
— z= —— _ (x — 2) ——(z— SY ee 125 
2 C,x + C22 Ly ; Cit +Gee 


1 


Lis . Le 2 dD, Ds DD Dz : 
—£ + —Zz2=|( —K,+— Ke} — [| —2+—2)—Aay. (126) 
2 Ly Le L, 


Subtraction of (125) from (124) leads to 


ie Des « D, De D, D, 
— «4% ——z=| — K,—— kK, ) — |—a«——z 
ee ie L, i In L, 


2C,C2%z Cix — Coz ° a 
— ——__ (4¥— 2) —1 ———_-y 
Ci% + Caz C2 + .C.z 
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We now introduce the following new variables: 


ie eed 

2 =?¢, (128) 

TZ 

oun (129) 
so that 

L=pry (130) 

Ua Ow (131) 


We now find, instead of (126) and (127), 


dset Lee  f,—[5°- dD, Ds D, De\ ¢ 
o + Ba ec (cca) 


2 2 L, L, I, Ly 
D. D, (182) 
= SF — Se /h ¥ 
( jy ae )e : 
1 Fell OR L, + Le S D, D, dD, Dz 
ot wr=| — K,—— kK, )—( ——— 
2 2 ( : a & =) 
Diewa D 4C,C.(¢? — y? 
(g?—y)y (133) 


eh | oes Pee a cea ee a RTE os OM 
G ie (EET STO: 
(C,—C.)¢ + (C1 + Cz)y - 
(Ci + C2)¢ + (C,— C2) yp 


Although unnecessary, we will make the following simplifying as- 
sumptions in the remainder of this paper: 


Lila Ls, (184) 

C,=—=C,=C. (1385) 

Furthermore, for simplification, we shall introduce the following 
notations: 


dD, Dz 
—K,+—k,=24A,, (136) 
L Lt 
D, Dz 
—— K-——K,=2.4,, (137) 
L? L? 
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D; 
BED ery (138) 
L? 
Dz 
BE AeA Ae (139) 
L? 
kntkh,=a, (140) 
k—kh=8, (141) 
A 
—=7, (142) 
L 
2C 
—_—>Yy. (1438) 
L 


Introduction of (1384)—(143) yields the equations: 
¢ =24A,—a¢—Py—r4, (144) 
e wy yp ° 
p=2e—po—ayp—y(¢?—y?) Seba Aa (145) 


For the special case considered above—(134) and (1385)—we obtain 

from (121), after introducing into the latter (119) and (120) and 

differentiating, 

oy o(¢?—y) + p2oy 

= 5 oars aia d 

if we neglect powers of 4 higher than the first one. 
From (144), (145), and (146) we obtain the following equations 


linear in $ and y: 


Ay (146) 


poy aN. yp: 
an i Jar ert pe day By, (147) 


( Sree 2) x 2 2 

at (14208 wean sp—ay 
148 

Li a ayy ( ) 


$ 
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These equations can be solved for $ and y. The solutions are of the 
form: 


Se geet a pe Ae a), (149) 


p=G(o,v.7,0,8,7,41, 4). (150) 


In principle we can obtain from the set (149) and (150) a sec- 
ond-order equation e.g. in y by elimination of ¢. To make use of that 
equation we must introduce drastic approximations. Explicit calcu- 
lation of F in (149) leads to: 


- 2A a B ie ' (151) 
co) L L° L Warg@ls Goa odls 


Neglecting as a zero approximation the terms in + in this equation, 7 
being small according to our assumptions, we obtain: 


oy a8 B 
Go L Det (152) 
Assuming f/a << 1 we have 
oer (153) 
L L 


which is similar to (81). 
As in (31) we find as the steady state value for ¢: 


2A; 


o= =A, (154) 


a 


which we will keep as our first approximation for ¢. 


Introducing 
is (155) 
A, 
eee ; (156) 
A, 


we find from (154) 
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We obtain for € an equation of the form: 
; P;(n, 
= Poly, 6) é (158) 
P, (n e) 
in which P;(7, &) and P3(7, €) are polynomials of the order 5 and 3 
in € respectively. 
By differentiating both sides of (158) we obtain 


oP; Ors Oulzs oP: 


4 a& x Cie ua 3 an - 
pi tte ee (159) 
P? Be 


Substituting in this equation (157) and an approximation for n high- 
er than that corresponding to (157), we finally obtain: 


E+o(é)& + (é)=0. (160) 


At the present stage of development, both experimental and theo- 
retical, it is hardly worthwhile to derive explicitly the expressions 
for ®(é) and ¥(é). Though this field of mathematics is momentarily 
in a rapid rate of progress, as shown by recent work (Andronow and 
Chaikin, 1937; Minorsky, 1947; Stoker, 1950; Lefshetz, 1950; Kry- 
loff and Bogoliuboff, 1934), the theory for the case in which #(é) 
and ¥(&) are both of higher order in & than the first does not seem 
to be in a form that is ready for application. In a former paper 
(Karreman, 1949) the author showed (as mentioned in the first part 
of this paper it had been shown earlier) that in the case in which 
WY (é) is proportional to €, a relaxation oscillation with threshold can 
be obtained when ®(&) is at least of the fourth degree and @(&) = 0 
has at least two appropriate positive and two appropriate negative 
roots. Although in our present case ¥ = W(&) is much more compli- 
cated, even in the lowest approximation, we still believe (although we 
have no proof) that for a self-sustained oscillation it is necessary, in 
general, for the function 6 = ®(&) to be negative for sufficiently long 
intervals of positive values of € and also negative values of é. It is 
interesting now to note that for the special case treated above [see 
eqs. (184) and (135)] #(&) can be shown to have for small + the fol- 
lowing form: 


6(é) =h((1 4+») (14 27) b27—B8a2) Ee 


—[But+27(24+5n)] & +4708 (161) 
+4ruE 3/1471 +&)p. 
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The dimensionless constants are given by 


Ae 


u ; (162) 
a 
€1 
me ee) 
B 
ct omet (164) 
a 
k=a, (165) 


together with (136), (187), (140), and (141). 

It is easy to see that ®(€) = 0 has for small 7 two positive and 
two negative roots and has a form similar to that shown in Figure 22, 
if wis sufficiently large (z and az being both less than 1, the latter only 
in absolute value). 

Depending on the relative position of the four roots &,, é, &, 
and &, the situation shown in Figure 22 may lead to the following 


two cases: 


FIGURE 22 


1. Only a damped oscillation occurs below a certain “critical” 
finite disturbance (corresponding to a threshold), if none of the re- 
gions of negative damping is reached by it from the original equilib- 


rium state. .s 
2. If the disturbance, above a certain threshold, from the origi- 


nal equilibrium state is such that at least one of the regions has been 
reached, two things may happen depending upon the circumstances: 


a. The regions of negative damping are so far apart that even 
if self-excited in one of such regions the & is damped so much in the 
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intermediate region that it does not reach the other region (cf. Karre- 
man, 1949, where it was shown that in a similar situation; namely, 
of one region of negative damping, the oscillation was always damped). 
This would correspond to one spike. 

b. The regions of negative damping are sufficiently close so that 
an undamped threshold oscillation occurs, corresponding to repetitive 
discharges. 


In this way we see that a mechanism such as proposed here might 
account for a spike as well as an oscillation, both with threshold 
properties. It is interesting to note that the parameter yu, the relative 
order of magnitude of which was found to be important above, is 
determined by the Ca* ion concentration. Rough estimations, tak- 
ing into account the order of magnitude of the values of a, Po and 
the equilibrium constants K, and K, in (16) and (14) show that C; 
in (119), (120), and (135) is approximately inversely proportional 
to \/ (Ca**). Because of (143) and (162) the same holds true for uw. 
We noticed above that uw has to be sufficiently large, particularly if 
we realize that the four roots discussed above have to be within the 
physiological limits, which cannot be so easily obtained any longer. 
Therefore, theoretically we might be led to expect relaxation oscil- 
lations for a sufficiently small value.of Ca*+, which is in agreement 
with experimental evidence (Brink and others, 1946) in the case of 
nerve fibers. 

Discussion. In a very simple way an attempt has been made to 
investigate quantitatively the consequences of structural changes in 
a membrane due to chemical reactions of K* ions and a layer consist- 
ing of CaP. compound through which K* ions diffuse. On the basis of 
these chemical reactions and a “hole” concept, a quantitative defini- 
tion of the permeability of the membrane, which is supposed to have 
as an essential element such a CaP, layer, to K+ ions is given. Several 
possibilities for this have been indicated. One, namely that the per- 
meability of the membrane is proportional to the sum of the concen- 
trations of the formed KP and of the P-, is elaborated. Equations 
describing the diffusion of K* ions are given with the aid of the ap- 
proximation method for the case in which the membrane is supposed 
to have finite thickness and at one or both sides such a CaP, layer. 
In the last part of this paper we have indicated that this explicit con- 
sideration of the finite thickness of the membrane leads to nonlinear 
differential equations which are not amenable to solution at the pres- 
ent time. However there seems to be some indication that they may 
lead to equations describing relaxation oscillations with threshold 
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properties in qualitative agreement with experimental facts, for in- 
stance, as far as the dependence of (Ca*+) is concerned. If it is pos- 
sible in the future to determine the frequency dependence it may be 
that theoretically observed properties such as frequency demultipli- 
cation and dependence of the frequency on (Ca**) can be determined. 
However since this is a program for the future, a similar mechanism 
based on the ionization of a CaP, layer has been assumed in the cen- 
tral part of this paper as responsible for the determination of the 
permeability of a membrane whose thickness has been neglected for 
reasons of mathematical simplicity in the equations describing the dif- 
fusion of K* ions through it. At the same time some electrical effects 
due to an applied external electrical field and a “resting potential” 
across the membrane, using for the latter a diffusion potential of K+ 
ions, on the chemical reactions of the CaP, layer are taken into ac- 
count. For mathematical simplicity other electrical effects such as 
electrical forces on the diffusing K* ions, the effects of other ions (e.g., 
Ca**) on the potential across the membrane, as well as the diffusion of 
other ions (e.g., Ca** and anions) have been neglected. These omis- 
sions, particularly the first two, are thought to be serious shortcom- 
ings. On the basis of such a mechanism the permeability of the mem- 
brane to K* ions is determined. Explicit relations for the dependence 
of the membrane permeability on an externally applied electrical field, 
on the Ca* ion concentration, on the average K* ion concentration level 
have been derived in principle. The dependence of the permeability to 
potassium ions on the K* ion concentration is used in the differential 
equations describing the diffusion of the K* ions through the mem- 
brane. The latter equations are solved under the assumption that the 
average K+ ion concentration has its steady state value. The solution 
shows threshold properties under certain circumstances. Estimations 
of “resting potential,” threshold (chemical as well as electrical), “ac- 
tion potential,” permeability increase, and electrical field across the 
CaP. layer are made and shown to have the correct order of magni- 
tude. A possible way to derive the one-factor theory is outlined. Also, 
the time course of the excitatory state for various intensities of an 
initial chemical stimulus has been derived theoretically from the pro- 
posed physical mechanism. The curves obtained show all the char- 
acteristic features of the corresponding ones for electrical stimula- 
tion. It is pointed out, therefore, that these results suggest that re- 
sponse to both electrical and chemical stimulation is based on a com- 
mon threshold phenomenon as proposed here, which may very well 
be a local process. Furthermore it is pointed out that the dependence 
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of the latter curves on external parameters such as the Ca* ion and 
the external K+ ion concentration may be determined in principle. A 
more general treatment based on a more general dependence of the 
permeability to K* ions on the local K* ion concentration is given and 
applied to derive the relation between the electrical threshold and the 
K+ ion concentration on the outside. Other applications to derive rela- 
tions between the chemical threshold, the Ca* ion concentration, and 
the applied electrical potential are made. The dependence of the elec- 
trical threshold on the Ca** ion concentrations has been derived theo- 
retically. Comparison of the result obtained with the corresponding 
experimental result shows that the former gives a more sensitive de- 
pendence of the threshold on the Ca* ion concentration. This may 
be due to the fact that no accommodation effects have been consid- 
ered theoretically, that the theoretical result refers to local stimula- 
tion, and that the threshold derivation neglects the diffusion of Ca*+ 
itself as well as the influence of Ca** on the potential difference across 
the CaP, layer considered. Also a current-voltage characteristic has 
been derived and is compared with the one experimentally obtained. 
The differences are ascribed to accommodation effects. The possibil- 
ity of obtaining the dependence of these curves on Ca* ion and ex- 
ternal K+ ion concentration is mentioned. Additional relations which 
may be predicted on the basis of this theory are the increase in per- 
meability to K* ions during activity as a function of the external K+ 
ion concentration, of the Ca* ion concentration and of the applied 
electrical potential (A V) (cathodic as well as anodic). These pos- 
sible extensions all call for more experimentation along these lines. 


Though there is a possible manner in which metabolic reactions 
may be taken into account, namely, through a change in P, in equa- 
tions (8) and (44), they have been neglected above because it was 
supposed that the metabolic reactions may be assumed to be relatively 
slow compared with the ionic reactions. Therefore, recovery processes 
have not been taken into account. It may be worth mentioning here 
that a mechanism as proposed by E. Eriksson (1949) might perhaps 
be applied here, the breakdown of the CaP, layer as obtained here giv- 
ing an increase in the P~ anion concentration. 


The duration of the development of the action potential could 
not be determined theoretically on the basis of this mechanism. It 
seems plausible that the mechanism outlined for obtaining relaxation 
oscillations might be better in this respect. However, it should be 
stressed here that the considerations about the relaxation oscillations 
in the latter part of this paper are only suggestive and might be a 
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starting point for more detailed investigations, complicated as they 
are. However, they are by no means conclusive and, as will be obvi- 
ous from the outline given above, they constitute no proof whatso- 
ever. Particularly this holds as long as the physiological limits and, 
especially, the range of validity of the approximations made have not 
been determined yet. 

Finally, we would like to point out that while we have tried to 
give an underlying physical picture for changes in permeability to K+ 
ions, formally similar mathematical considerations may describe other 
dependences of the permeability on the K+ ion concentration or the 
same, or similar, dependence based on another picture. Furthermore, 
it might be that similar considerations may apply to ions other than 
the ion we called the K* ion. 


Summary. 


1. On the basis of the assumption that an essential element of 
a membrane is a CaP, layer, an expression for the permeability of the 
membrane to K* ions diffusing through it and reacting chemically 
with it has been derived theoretically, taking into account some elec- 
trical effects due to the diffusion potential of the K* ions and an ex- 
ternally applied electrical potential difference on the chemical reac- 
tions. 


2. A relation between the permeability to K* ions and the con- 
centration of the K+ ions outside the membrane has been derived, 
which shows the influence of some factors [Ca* ion concentration, 
externally applied electrical field (A V)] on the former. 


3. A threshold of the right order of magnitude is shown to 
exist for the amount of K* ions locally applied to the membrane in 
order to excite. Also a threshold for parametric electrical stimula- 
tion is shown to exist and to have the correct order of magnitude. A 
way to derive the dependence of the chemical threshold on the Ca* 
ion concentration has been given. The degree of depolarization of the 
membrane as calculated here is adequate. 

4. Correct orders of magnitude are found on this basis for the 
resting potential, thresholds, permeability increase, and intensity of 
the electric field strength across the CaP, layer. 

5. A possible way to derive the one-factor theory of excitation 
is indicated. 

6. A method is outlined to obtain in principle the time course 
of the excitatory disturbance and of the local response. 
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7. A way to obtain relaxation oscillations on the basis of a simi- 


lar mechanism is outlined. sind 
8. Derivations of other possible relationships are indicated. 
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